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PURIFIED THERMOSTABLE NUCLEIC ACID POLYMERASE 
ENZYME FROM THERMOTOGA MARTTIMA 

Technical Field 

The. present invention relates to a purified, thermostable DNA polymerase 
5 purified from tJie hyperthermophilic eubacteria Thermotoga maritima and means for 
isolating and producing the enzyme. Themiostable DNA polymerases are useful in 
many recombinant DNA techniques, especially nucleic acid amplification by the 
polymerase chain reaction (PGR). 

Background Art 

10 In Huber £t al., 1986, Arch , Microbiol . 144:324-333, the isolation of the 

bacterium Themiotog a maritima is described. T, maritima is a eubacterium that is 
stricdy anaerobic, rod-shaped, fermentative, hypenhermophilic, and grows between 
55*C and 90'C, with an optimum growth temperature of about 80*C. This eubacterium 
has been isolated from geoihenmally heate^J sea floors in Italy and the Azores. X 

15 maritima cells have a sheaih-likc structure and monotrichous flagellation. I. maritima 
is classified in the eubacterial kingdom by vinue of having murein and fatty acid- 
containing lipids, diphtheria-ioxin-rcsistant elongation factor 2, an RNA polymerase 
subunit pattern, and sensitivity to antibiodcs. 

Extensive research has been conducted on the isolation of DNA polymerases 

20 from mesophilic microorganisms such as £. coli . See, for example, Bessman £i al*. 
1957, 1 Biol . Chem . 221'M\-\11. and Buttin and Komberg, 1966, 1. EM. Chcm . 
241 :5419-5427. Much less investigation has been made on the isolation and 
purification of DNA polymerases from thermophiles such as Themiotoga maritima . In 
Kaledin et al-, 1980, Biokhymiya 4^:644-65 1 , a six-step isolation and enrichment 

25 procedure for DNA polymerase activity fix)m cells of a Thermus aquaricus YT- 1 strain 
is disclosed. These steps involve isolation of crude extract, DEAE-cellulose 
chromatography, fractionation on hydroxyapatite, fractionation on DEAE-cellulose, and 
chromatography on single-strand DN A-cellulose. The molecular weight of the puritled 
enzyme is reponed by Kaledin £i al. as 62.000 daltons per monomeric unit. 

30 A second enrichment scheme for a polymerase from Thermus aquaticus is 

described in Chien et al-, 1976, ;_. Bacteriol . 127:1550-1557. In this process, the 
crude extract is applied to a DEAE-Sephadex column. The dialyzed pooled fractions 
are then subjected to treatment on a phosphocellulose column. The pooled fractions are 
dialyzed, and bovine serum albumin (BSA) is added to prevent loss of polymerase 

35 activity. The resulting mixture is loaded on a DNA-cellulose column. The pooled 
material from the column is dialyzed. The molecular weight of the purified protein is 
reponed to be about 63,000 daltons to 68,000 daltons. 



wo 92/03556 



PCT/tlS91/057S3 



2 

The use of thermosiable enzymes, such as those described in Chien si 31- and 
Kaledin to amplify existing nucleic acid sequences in amounts that are large 
compared to the amount initially present is described in U.S. Patent Nos. 4,683,195; 
4,683,202; and 4,965,188, which describe the PGR process, each of which is 

5 incorporated herein by reference. Primers, template, nucleoside triphosphates, the 
appropriate buffer and reaction conditions, and polymerase are used in the PGR 
process, which involves denaturation of target DNA. hybridization of primers, and 
synthesis of complementary strands. The extension product of each primer becomes a 
template for the production of the desired nucleic acid sequence. The patents disclose 

10 that, if the polymerase employed is a thermostable enzyme, then polymerase need not 
be added after every denaturation step, because heat will not destroy the polymerase 
activity. 

U.S. Patent No. 4,889,818. European Patent Publication No. 258,017, and 
PCT Publication No. 89/06691 , the disclosures of which are incorporated herein by 

1 5 reference, describe the isolation and recombinant expression of an -94 kDa 

thermostable DNA polymerase from Thcrmus aquancus and die use of that polymerase 
in PGR. Although I. aquaticus DNA polymerase is especially preferred for use in 
PGR and otiier recombinant DNA techniques, there remains a need for other 
thermostable polymerases. 

20 Accordingly, there is a desire in tiie an to produce a purified, thermostable DNA 

polymerase that may be used to improve the PGR process described above and to 
improve the results obtained when using a thermostable DNA polymerase in other 
recombinant techniques, such as DNA sequencing, nick-translation, and even reverse 
transcription. The present invention helps meet that need by providing recombinant 

25 expression vectors and purification protocols for Thg. rmQtQ ga maririma DNA 
polymerase. 

Disclosure of Invention 

The present invention pro\'ides a purified thermostable DNA pol>'merase i 
enzyme tiiat catalyzes com.bination of nucleoside triphosphates to form a nucleic acid 

30 strand complementary to a nucleic acid template strand. The purified enzyme is the 
DNA polymerase I from Thermotoga maririma (Tma ) and has a molecular weight of 
about 97 kilodaltons (kDa) as measured by SDS-PAGE and an inferred molecular 
weight, from the nucleotide sequence of the Ima DNA polymerase gene, of 102 kDa. 
This purified material may be used in PGR to produce a given nucleic acid sequence in 

35 amounts that are large compared to the amount initially present so that the sequences 
can be manipulated and/or analyzed easily. 

The gene encoding Tma DNA polymerase enzyme from Thenmotoga maritima 
has also been identified, cloned, sequenced, and expressed at high level and provides 
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yet another means to prepare the thermostable enzyme of the present invention. In 
addition to the intact gene and the coding sequence for the Tma enzyme, derivatives of 
the coding sequence for Tma DNA polymerase are also provided. 

The invention also encompasses a stable enzyme composition comprising a 
5 purified, thermostable Tma enzyme as described above in a buffer containing one or 
more non-ionic polymeric detergents. 

Finally, the invention provides a method of purification for the thermostable 
polymerase of the invention. This method involves preparing a crude extract from 
ThgmiQtog^ maririma cells, adjusting the ionic strength of the crude extract so that the 
10 DNA polymerase dissociates from nucleic acid in tiie extract, subjecting the extract to 
hydrophobic interaction chromatography, subjecting tiie extract to DNA binding protein 
affinity chromatography, and subjecting the extract to cation or anion exchange or 
hydroxyapatite chromatography. In a preferred embodiment, these steps are performed 
sequentially in the order given above. The nucleotide binding protein affinity 
15 chromatography step is preferred for separating the DNA polymerase from 
endonuclease proteins. 

Modes for Carrying out the Invention 

The present invention provides DNA sequences and expression vectors that 
encode Tma DNA polymerase, purification protocols for Tma DNA polymerase, 
20 preparations of purified Tma DNA polymerase, and methods for using Tma DNA 

polymerase. To facilitate understanding ol Ae invention, a number of terms are defined 
below. 

The temis "cell," "cell line," and "cell culture" are used interchangeably and all 
such designations include progeny. Thus, the words "transfomiants" or "transfomied 

25 cells" include the primary u^sformed cell and cultures derived fix»m that cell without 
regard to the number of transfers. All progeny may not be precisely identical in DNA 
- content, due to deliberate or inadvertent mutations. Mutant progeny that have the same 
functionality as screened for in the originally transformed cell are included in the 
definition of transformanis. 

30 The term "control sequences" refers to DNA sequences necessary for the 

expression of an operably linked coding sequence in a particular host organism. The 
control sequences that are suitable for procaryoies, for exam: . include a promoter, 
optionally an operator sequence, a ribosome binding site, and possibly other 
sequences, such as transcription temiination sequences. Eucaryotic cells are known to 

35 utilize promoters, polyadenylation signals, and enhancers. 

The term "expression system" refers to DNA sequences containing a desired 
coding sequence and control sequences in operable linkage, so that hosts transfomied 
with these sequences are capable of producing the encoded proteins. To effect 



wo 92/03556 



PCr/US91/05753 



transformation, the expression system may be included on a vector; the relevant DNA 
can also be integrated into the host chromosome. 

The term "gene" refers to a DNA sequence that codes for the expression of a 
recoverable bioactive polypeptide or precursor. Thus, the Ima DNA polymerase gene 
5 includes die promoter and Ima DNA polymerase coding sequence. The polypeptide 
can be encoded by a full-length coding sequence or by any portion of the coding 
sequence so long as the desired enzymatic activity is retained. 

The term "operably Unked" refers to the positioning of die coding sequence 
such Uiat control sequences wUl function to drive expression of the encoded protein. 
10 Thus, a coding sequence "operably Unked" to a control sequence refers to a 

configuration wherein the coding sequence can be expressed under the direction of the 
control sequence. 

The term "mixture" as it relates to mixtures containing Ima polymerase refers to 
a coUection of materials that includes Ima polymerase but can also include other 
15 proteins. If Uie Ima polymerase is derived from recombinant host ceUs, the other 
proteins will ordinarily be those associated with the host. Where the host is bacterial, 
the contantinating proteins will be bacterial proteins. 

Tlie term, "non-ionic polymeric detergents" refers to surface-active agents that 
have no ionic charge and that are characterized, for purposes of this invention, by an 
20 ability to stabiUze the Jm enzyme at a pH range of from about 3.5 to about 9.5. 
preferably from 4 to 8.5. Numerous examples of suitable non-ionic polymeric 
detergents are presented in copending U.S. patent application Serial No. 387,003, filed 
July 28, 1989, the disclosure of which is incorporated herein by reference. 

The term "oligonucleotide" as used herein is defined as a molecule comprised of 
25 two or more deoxyribonucleotides or ribonucleotides, preferably more tiian three, and 
usually more than ten. The exact size will depend on many factors, which in turn 
depend on the ultimate function or use of the oligonucleotide. The oligonucleotide may 
be derived synthetically or by" cloning. 

The term "primer" as used herein refers to an oligonucleotide that is capable of 
30 acting as a point of initiation of synthesis when placed under conditions in which 

primer extension is initiated. An oligonucleotide "primer" may occur naturally, as in a 
purified restriction digest, or be produced synthetically. Synthesis of a primer 
extension product that is complementary to a nucleic acid strand is initiated in the 
presence of four different nucleoside triphosphates and the Ima thermostable enzyme in 
35 an appropriate buffer at a suitable temperature. A "buffer" includes cofactors (such as 
divalent metal ions) and salt (to provide the appropriate ionic strength), adjusted to the 
desired pH. For Ima polymerase, the buffer preferably contains 1 to 3 mM of a 
magnesium salt, preferably MgCl2, 50 to 200 of each nucleoside triphosphate, and 
0.2 to 1 \iM of each primer, along with 50 mM KCl, 10 mM Tris buffer (pH 8.0-8.4), 
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and 100 [ig/ml gelatin (although gelatin is not required and should be avoided in some 
applications, such as DNA sequencing). 

The primer is single-stranded for maximum efficiency in amplification, but may 
alternatively be double- stranded. If double-stranded, the primer is first treated to 
5 separate its strands before being used to prepare extension products. The primer is 
usually an oligodeoxyribonucleoride. The primer must be sufficiently long to prime the 
synthesis of extension products in the presence of the polymerase enzyme. The exact 
length of a primer will depend on many factors, such as source of primer and result 
desired, and the reaction temperature must be adjusted depending on primer length to 
10 ensure proper annealing of primer to template. Depending on the complexity of the 
target sequence, the oligonucleotide primer typically contains 15 to 35 nucleotides. 
Short primer molecules generally require cooler temperatures to form sufficienUy stable 
complexes with template. 

A primer is selected to be "substantially" complementary to a strand of specific 
15 sequence of the template. A primer must be sufficiently complementary to hybridize 
with a template strand for primer elongation to occur. A primer sequence need not 
reflect the exact sequence of the template. For example, a non-complementary 
nucleotide fragment may be attached to the 5* end of the primer, with the remainder of 
Uie primer sequence being substantially complementary to the strand. Non- 
20 complementary bases or longer sequences can be interspersed into the primer, provided 
that the primer sequence has sufficient complementarity with the sequence of the 
template to hybridize and thereby form a template/primer complex for synthesis of the 
extension product of the primer. 

The tenms "restriction endonucleases" and "restriction enzymes" refer to 
25 bacterial enzymes that cut double-siranded DNA at or near a specific nucleotide 
sequence. 

The term "thermostable enzyme" refers to an enzyme which is stable to heat and 
is heat resistant and catalyzes (facilitates) combination of the nucleotides in the" proper 
manner to form primer extension products that are complementary lo a nucleic acid 

30 strand. Generally, synthesis of a primer extension product begins at the 3* end of the 
primer and proceeds towards the 5' end of the template strand until synthesis 
terminates. A thermostable enzyme must be able to renamre and regain activity after 
brief (i.e., 5 to 30 seconds) exposure to temperatures of 80'C to 105'C and must have 
a temperature optimum of above 60*C. 

35 The Tma thermostable DNA polymerase enzyme of the present invention 

satisfies the requirements for effective use in the amplification reaction known as the 
polymerase chain reaction or PGR. The Tma DNA polymerase enzyme does not 
become irreversibly denatured (inactivated) when subjected to Uie elevated temperatures 
for the rime necessary to effect denaturauon of double-stranded nucleic acids, a key step 
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in the PGR process. Irreversible denaturation of an enzyme for purposes herein refers 
to permanent and complete loss of enzymatic activity. 

The heating conditions necessary to effect nucleic acid denaturation will depend, 
e.g., on the buffer salt concentration and the composition, length, and amount of the 
5 nucleic acids being denatured, but typically the denaturation temperature ranges from 
about 80'C to about 105'C for a few seconds to minutes. Higher temperatures may be 
required for nucleic acid denaturation as the buffer salt concentration and/or GC 
composition of the nucleic acid is increased. The Tnm enzyme does not become 

irreversibly denatured upon relatively short exposures to temperatures of about 80'C- 
10 105*C. 

The Iim thermostable enzyme has an optimum temperature at which it 
functions that is higher than about 60'C. Temperatures below 60'C faciUtate 
hybridization of primer to template, but depending on salt composition and 
concentration and primer composition and length, hybridization of primer to template 

15 can occur at higher temperatures (e.g., eO'C-gO'C), which may promote specificity of 
the primer elongation reaction. The higher the temperature optimum for the enzyme, 
the greater the specificity and/or selectivity of the primer-directed extension process. 
The Trrm enzyme exhibits activity over a broad temperature range from about 45*C to 
90*C; a preferred optimum temperature is 75*C-80*C. 

20 The present invention also provides DNA sequences encoding the thermostable 

DNA polymerase I activity nfTh^rmntnpa maririma . The amino acid sequence encoded 
by this sequence has homology to portions of the thermostable DNA polymerases of 
Thermus aquaticus and Themius thenmophilus . The complete coding sequence, from 
the 5'-ATG Stan codon to the TGA-3' stop codon, of the Jma DNA polymerase gene is 
25 depicted below and listed as SEQ ID NO: 1 in the sequence listing section. The 
sequence is numbered for reference. 

1 ATGGCGAGAC TATTTCTCTT TGATGGAACT GCTCTGGCCT ACAGAGCGTA 

51 CTATGCGCTC GATAGATCGC TTTCTACTTC CACCGGCATT CCCACAAACG 

101 CCACATACGG TGTGGCGAGG ATGCTGGTGA GATTCATCAA AGACCATATC 

30 151 ATTGTCGGAA AAGACTACGT TGCTGTGGCT TTCGACAAAA AAGCTGCCAC 

201 CTTCAGACAC AAGCTCCTCG AGACTTACAA GGCTCAAAGA CCAAAGACTC 

251 CGGATCTCCT GATTCAGCAG CTTCCGTACA TAAAGAAGCT GGTCGAAGCC 

301 CTTGGAATGA AAGTGCTGGA GGTAGAAGGA TACGAAGCGG ACGATATAAT 

351 TGCCACTCTG GCTGTGAAGG GGCTTCCGCT TTTTGATGAA ATATTCATAG 

35 401 TGACCGGAGA TAAAGACATG CTTCAGCTTG TGAACGAAAA GATCAAGGTG 

451 TGGCGAATCG TAAAAGGGAT ATCCGATCTG GAACTTTACG ATGCGCAGAA 

501 GGTGAAGGAA AAATACGGTG TTGAACCCCA GCAGATCCCG GATCTTCTGG 

551 CTCTAACCGG AGATGAAATA GACAACATCC CCGGTGTAAC TGGGATAGGT 

601 GAAAAGACTG CTGTTCAGCT TCTAGAGAAG TACAAAGACC TCGAAGACAT 
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10 



15 



20 



25 



30 



35 



651 


ACTGAATCAT 


GTTCGCGAAC 


TTCCTCAAAA 


GGTGAGAAAA 


GCCCTGCTTC 


701 


GAGACAGAGA 


AAACGCCATT 


CTCAGCAAAA 


AGCTGGCGAT 


TCTGGAAACA 


751 


AACGTTCCCA 


TTGAAATAAA 


CTGGGAAGAA 


CTTCGCTACC 


AGGGCTACGA 


801 


CAGAGAGAAA 


CTCTTACCAC 


TTTTGAAAGA 


ACTGGAATTC 


GCATCCATCA 


851 


TGAAGGAACT 


TCAACTGTAC 


GAAGAGTCCG 


AACCCGTTGG 


ATACAGAATA 


901 


GTGAAAGACC 


TAGTGGAATT 


TGAAAAACTC 


ATAGAGAAAC 


TGAGAGAATC 


951 


CCCTTCGTTC 


GCCATAGATC 


TTGAGACGTC 


TTCCCTCGAT 


CCTTTCGACT 


1001 


GCGACATTGT 


CGGTATCTCT 


GTGTCTTTCA 


AACCAAAGGA 


AGCGTACTAC 


1051 


ATACCACTCC 


ATCATAGAAA 


CGCCCAGAAC 


CTGGACGAAA 


AAGAGGTTCT 


1101 


GAAAAAGCTC 


AAAGAAATTC 


TGGAGGACCC 


CGGAGCAAAG 


ATCGTTGGTC 


1151 


AGAATTTGAA 


ATTCGATTAC 


AAGGTGTTGA 


TGGTGAAGGG 


TGTTGAACCT 


1201 


GTTCCTCCTT 


ACTTCGACAC 


GATGATAGCG 


GCTTACCTTC 


TTGAGCCGAA 


1251 


CGAAAAGAAG 


TTCAATCTGG 


ACGATCTCGC 


ATTGAAATTT 


CTTGGATACA 


1301 


AAATGACATC 


TTACCAAGAG 


CTCATGTCCT 


TCTCTTTTCC 


GCTGTTTGGT 


1351 


TTCAGTTTTG 


CCGATGTTCC 


TGTAGAAAAA 


GCAGCGAACT 


ACTCCTGTGA 


1401 


AGATGCAGAC 


ATCACCTACA 


GACTTTACAA 


GACCCTGAGC 


TTAAAACTCC 


1451 


ACGAGGCAGA 


TCTGGAAAAC 


GTGTTCTACA 


AGATAGAAAT 


GCCCCTTGTG 


1501 


AACGTGCTTG 


CACGGATGGA 


ACTGAACGGT 


GTGTATGTGG 


ACACAGAGTT 


1551 


CCTGAAGAAA 


CTCTCAGAAG 


AGTACGGAAA 


AAAACTCGAA 


GAACTGGCAG 


1601 


AGGAAATATA 


CAGGATAGCT 


GGAGAGCCGT 


TCAACATAAJV 


CTCACCGAAG 


1651 


CAGGTTTCAA 


GGATCCTTTT 


TGAAAAACTC 


GGCATAAAAC 


CACGTGGTAA 


1701 


AACGACGAAA 


ACGGGAGACT 


ATTCAACACG 


CATAGAAGTC 


CTCGAGGAAC 


1751 


TTGCCGGTGA 


ACACGAAATC 


ATTCCTCTGA 


TTCTTGAATA 


CAGAAAGATA 


1801 


CAGAAATTGA 


AATCAACCTA 


CATAGACGCT 


CTTCCCAAGA 


TGGTCAACCC 


1851 


AAAGACCGGA 


AGGATTCATG 


CTTCTTTCAA 


TCAAACGGGG 


ACTGCCACTiS 


1901 


GAAGACTTAG 


CAGCAGCGAT 


CCCAATCTTC 


AGAACCTCCC 


GACGAAAAGT 


1951 


GAAGAGGGAA 


AAGAAATCAG 


GAAAGCGATA 


GTTCCTCAGG 


ATCCAAACTG 


2001 


GTGGATCGTC 


AGTGCCGACT 


ACTCCCAAAT 


AGAACTGAGG 


ATCCTCGCCC 


2051 


ATCTCAGTGG 


TGATGAGAAT 


CTTTTGAGGG 


CATTCGAAGA 


GGGCATCGAC 


2101 


GTCCACACTC 


l.'VACAGCTTC 


CAGAATATTC 


AACGTGAAAC 


CCGAAGAAGT 


2151 


AACCGAAGAA 


ATGCGCCGCG 


CTGGTAAAAT 


GGTTAATTTT 


TCCATCATAT 


2201 


ACGGTGTAAC 


ACCTTACGGT 


CTGTCTGTGA 


GGCTTGGAGT 


ACCTGTGAAA 


2251 


GAAGCAGAAA 


AG ATGATCGT 


CAACTACTTC 


GTCCTCTACC 


CAAAGGTGCG 


2301 


CGATTACATT 


CAGAGGGTCG 


TATCGGAAGC 


GAAAGAAAAA 


GGCTATGTTA 


2351 


GAACGCTGTT 


TGGAAGAAAA 


AGAGACATAC 


CACAGCTCAT 


GGCCCGGGAC 


2401 


AGGAACACAC 


AGGCTGAAGG 


AGAACGAATT 


GCCATAAACA 


CTCCCATACA 


2451 


GGGTACAGCA 


GCGGATATAA 


TAAAGCTGGC 


TATGATAGAA 


ATAGACAGGG 


2501 


AACTGAAAGA AAGAAAAATG 


AGATCGAAGA 


TGATCATACA 


GGTCCACGAC 


2551 


GAACTGGTTT 


TTGAAGTGCC 


CAATGAGGAA 


AAGGACGCGC 


TCGTCGAGCT 
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2601 GGTGAAAGAC AGAATGACGA ATGTGGTA?A GCTTTCAGTG CCGCTCGAAG 
2651 TGGATGTAAC CATCGGCAAA ACATGGTCGT GA 

Both the complete coding sequence of the Ima DNA polymerase gene and the encoded 
amino acid sequence in three letter abbreviation are provided in the Sequence Listing 
section az SEQ ID NO: 1 . For convenience, the amino acid sequence encoded by the 
Tma DNA polymerase gene sequence is also depicted below in one letter abbreviation 
from amino-terminus to carboxy-terminus; the sequence is numbered for reference. 





T 
X 


MARL F LF D GT 


ALAYRAYYAL 


DRSLSTSTGI 


PTNATYGVAR 


MLVRFIKDHI 




SI 


TVGKDYVAVA 


FDKKAATFRH 


KLLETYKAQR 


PKTPDLLIQQ 


LPYIKKLVEA 


in 




T.f^MKVLEVEG 


YEADDIIATL 


AVKGLPLFDE 


IFIVTGDKDM 


LQLVNEKIKV 




151 


WRIVKGISDL 


ELYDAQKVKE 


KYGVEPQQIP 


DLLALTGDEI 


DNIPGVTGIG 




201 


EKTAVOLLEK 


YKDLEDILNH 


VRELPQKVRK 


ALLRDRENAI 


LSKKLAILET 




251 


NVPIEINWEE 


LRYQGYDREK 


LLPLLKELEF 


ASIMKELQLY 


EESEPVGYRI 




301 


VKDLVEFEKL 


lEKLRESPSF 


AIDLETSSLD 


PFDCDIVGIS 


VSFKPKEAYY 


15 


351 


IPLHHRNAQW 


LDEKEVLKKL 


KEILEDPGAK 


IVGQNLKFDY 


KVLMVKGVEP 




401 


VPPYFDTMIA 


AYLLEPNEKK 


FNLDDLALKF 


LGYICMTSYQE 


LMSFSFPLFG 




451 


FSFADVPVEK 


AANYSCEDAD 


ITYRLYKTLS 


LKLHEADLEN 


VFYKIEMPLV 




501 


NVLARMELNG 


VYVDTEFLKK 


LSEEYGKKLE 


ELAEEIYRIA 


GEPFNINSPK 




551 


QVSRILFEKL 


GIKPRGKTTK 


TGDYSTRIEV 


LEELAGEHEI 


IPLILEYRKI 


20 


601 


QKLKSTYIDA 


LPKl^VNPKTG 


RIHASFNQTG 


TATGRLSSSD 


PNLQNLPTKS 




651 


EEGKEIRKAI 


VPQDPNWWIV 


SADYSQIELR 


ILAHLSGDEN 


LLRAFEEGID 




701 


VHTLTASRIF 


NVKPEEVTEE 


t^RRAGKIWNF 


SIIYGVTPYG 


LSVRLGVPVK 




751 


EAEKMIVNYF 


VLYPKVRDYI 


QRWSEAKEK 


GYVRTLFGRK 


RDIPQLMARD 




801 


RNTQAEGERI 


AINTPIQGTA 


ADIIKLAMIE 


IDRELKERKM 


RSKMIIQVHD 


25 


851 


ELVFEVPNEE 


KDALVELVKD 


RI^TNVVKLSV 


PLEVDVTIGK 


TWS 



The one letter abbreviations for the amino acids are shown below for convenience. 

F = Phenylalanine H = Histidine 

L = Uucine Q = Glutamine 

I = Isoleucine N = Asparagu.e 

30 M = Methionine K = Lysine 

V = Valine D = AsparticAcid 
S = Serine E = Glutamic Acid 
P = Proline C = Cysteine 

T = Threonine W = Tryptophan 

35 A = Alanine R = Arginine 

Y = Tyrosine G . = Glycine 
The coding sequence for Tma DNA polymerase I was identified by a 

"degenerate primer" method that has broad utility and is an important aspect of the 
present invention. In the degenerate primer method, DNA fragments of any 
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thermostable polymerase coding sequence corresponding to conserved domains of 
known thermostable DNA polymerases can be identified. 

In one embodiment of the degenerate primer method, the corresponding 
conserved domains are from the coding sequences for and amino acid sequences of the 

5 thermostable DNA polymerases of Tma. and Iih . The degenerate primer method 
was developed by comparing the amino acid sequences of DNA polymerase I proteins 
from Tag . Tth . T7, and E. eeli in which various conserved regions were identified. 
Primers corresponding to these conserved regions were then designed. As a result of 
the present invention, Tma sequences can be used to design other degenerate primers. 

10 The generic utility of the degenerate primer process is exemplified herein by specific 
references to the method as applied to cloning the Tma gene. 

To clone the Tma DNA polymerase I gene, the conserved amino acid sequences 
were converted to all of the possible codons for each of the amino acids. Due to the 
degenerate nature of the genetic code, a given amino acid may be represented by several 

15 different codons. Where more than one base can be present in a codon for a given 
amino acid, the sequence is said to be degenerate. 

The primers were then synthesized as a pool of all of tlie possible DNA 
sequences that could code for a given amino acid sequence. The amount of degeneracy 
of a given primer pool can be determined by multiplying the number of possible 

20 nucleotides at each position. 

The more degenerate a primer pool, (i.e., the greater the number of individual 
unique primer DNA sequences within the pool), the greater the probability that one of 
the unique primer sequences will bind to regions of the target chromoso: »1 DNA other 
than the one desired - hence, the lesser the specificity of the resulting amplification. 

25 To increase the specificity of the amplification using the degenerate primers, the pools 
are synthesized as subsets such that the entire group of subsets includes all possible 
DNA sequences encoding the given amino acid sequence, but each individual subset 
only includes a pprtion: for example, one pool may contain either a G or C at a 
particular position while the other contains either an A or T at the same position. Each 

30 of these subpools is designated with a DG number. 

Both forward primers (directed from the 5* region toward the y region of the 
gene, complementary to the noncoding su^and) and reverse primers (directed from the 
3' region toward the 5' region of the gene, complementary to the coding strand) were 
designed for most of these conserved regions to clone Tma polymerase. The primers 

35 were designed with restriction sites at the 5' ends to facilitate cloning. The forward 
primers contained a Bglll restriction .site ( AGATCT), while die reverse primers 
contained an EcoRI restriction site (GAATTC). In addition, the primers contained 2 
nucleotides at the 5* end to increase the efficiency of cuning at the restriction site. 
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Degenerate primers were then used in PGR processes in which the target nucleic 
acid was chromosomal DNA from Thermotoga maritima. The products of the PGR 
processes using a combination of forward and reverse primer pools in conjuncuon wiUi 
a series of temperature profiles were compared. When specific products of similar size 
5 to the product generated using laa chromosomal DNA were produced, the PGR 

fragments were gel purified, rcampUfied and cloned into the vector BSM13H3:Bgin (a 
derivative of the Stratagcne vector pBSM+ in which the HMni site of pBSM+ was 
converted to a Bglll siic). Sequences were identified as potential thermostable DNA 
polymerase coding sequences if the sequences were found to encode amino acid 
10 sequences homologous to other known amino acid sequences in polymerase proteins, 
particularly those of laa polymerase and Iih polymerase. 

The portions of the Tom DNA polymerase gene were then identified in the 
chromosomal DNA of Thermotoga maritima by Southern blot analysis. The Ima 
chromosomal DNA was digested with a variety of enzymes and transferred to 
15 nitrocellulose filters. Probes labeled with 32? or biotin-dUTP were generated for 
various regions of the gene from the cloned PGR products. The probes were 
hybridized to the nitrocellulose-bound genomic DNA, allowing identification of the size 
of the chromosomal DNA fragment hybridizing to the probe. The use of probes 
.covering the 5* and 3' regions of the gene ensures that the DNA fragment(s) contain 
20 most if not all of the structural gene for the polymerase. Restriction enzymes are 
identified dial can be used to produce fragments that contain the structural gene in a 
single DNA fragment or in several DNA fragments to facilitate cloning. 

Once identified, the chromosomal DNA fragments encoding die Ima DNA 
polymerase gene were cloned. Ghromosomal DNA was digested with the identified 
25 restriction enzyme and size fractionated. Fractions containing the desired size range 
were concentrated, desalted, and cloned into die BSM13H3:BglII cloning vector. 
Glones were identified by hybridization using labeled probes generated from the 
previous cloned PGR products. The PGR products were then analyzed on 
polyacrylamide gels. 

30 The DNA sequence and amino acid sequence shown above and the DNA 

compounds that encode those sequences can be used to design and constnict 
recombinant DNA expression vectors to drive expression of Tma DNA polymerase 
activity in a wide variety of host cells. A DNA compound encoding all or part of die 
DNA sequence shown above can also be used as a probe to identify thermostable 

35 polymerase-encoding DNA from other organisms, and die amino acid sequence shown 
above can be used to design peptides for use as immunogens to prepare antibodies that 
can be used to identify and purify a theraiostable polymerase. 

Whedier produced by recombinant vectors diat encode the above amino acid 
sequence or by native Themiotoga maritima cells, however, Ima DNA polymerase will 
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typically be purified prior to use in a rccombinani DNA technique. The present 
invention provides such purification methodology. 

For recovering the native protein, the cells are grown using any suitable 
technique. Briefly, the cells are grown in "MMS "-medium containing (per liter): NaCl 
5 (6.Q3 g); MgS04-7H20 (1.75 g); MgClr6H20 (1,38 g); KCl (0,16 g); NaBr (25 mg); 
HsBOs (7.5 mg); SrCl2.6H20 (3.8 mg); KI (0.025 mg); CaQi (0.38 g); KH2PO4 
(0.5 g); Na2S (0.5 g); (NH4)2Ni(S04)2 (2 mg); trace minerals (Balch £i al... 1979, 
Microbiol . Rev , 41:260-296) (15 ml); resazurin (1 mg); and starch (5 g) at a pH of 6.5 
(adjusted with H2SO4). For growth on solid medium, 0.8% agar (Oxoid) may be 

10 added to the medium. Reasonable growth of the cells also occurs in "SME"-medium 
(Stetier fiifll., 1983, Svst . Appl. Microbiol . 4:535-551) supplemented with 0.5% yeast 
extract, or in marine broth (Difco 2216). 

After cell growth, the isolation and purification of the enzyme takes place in six 
stages, each of which is carried out at a temperature below room temperature, 

15 preferably about O'C to about 4'C, unless stated otherwise. In the first stage or step, 
the cells, if frozen, are thawed, lysed in an Aminco french pressure cell (8-20.000 psi), 
suspended in a buffer at about pH 7,5, and sonicated 10 reduce viscosity. 

In the second stage, ammonium sulfate is added to the lysate to prevent the Tma 
DNA polymerase from binding to DNA or other cell lysate proteins. Also in the second 

20 stage, Polymin P (polyethyleneiminc, PEI) is added to the lysate to precipitate nucleic 
acids, and the lysate is centrifugcd. 

In the third step, ammonium sulfate is added to the supernatant, and the 
supernatant is loaded onto a phenyl sepharose column equilibrated with a buffer 
composed of TE (50 mM Tris-Cl. pH 7,5, and 1 mM EDTA) containing 0,3 M 

25 ammonium sulfate and 0.5 mM DTT (diihiothreitol). The column is then washed first 
with the same buffer, second with TE-DTT (without ammonium sulfate), third with 
ethylene glycol-TE-DTT, and finally with 2 M urea in TE-DTT containing ethylene 
glycol. Unless the capacity of the phenylsepharose is exceeded (i.e. by loading more 
than -20-30 mg of protein per ml of resin) all of the Tma polymerase activity is retained 

30 by the column and elutes with the 2 M urea in TE-DTT containing ethylene glycol. 

In the fourth stage, the urea eluaie is applied to a heparin sepharose column 
which is equilibrated with 0.08 M KCl, 50 mM Tris-Cl (pH 7.5), 0. 1 mM EDTA. 
0.2% Tween 20 and 0.5 mM DTT. The column is then washed in the same buffer and 
the enzyme eluted with a linear gradient of 0.08 M to 0.5 M KCl buffer. The peak 

35 activity fractions were found at 0,225 M to 0.275 M KCl. 
_ . , . In the fifth stage, the fraction collected in the fourth stage is diluted with affigel- 
blue buffer without KCl and applied to an affigel-blue column equilibrated in 25 mM 
Tris-Cl (pH 7.5), 0.1 mM EDTA, 0,2% Tween 20, 0.5 mM DTT, and 0.15 M KCl. 
The column is washed v^th the same buffer and eluted with a linear gradient of 0.15 M 
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to 0.7 M KCl in the same buffer. The peak activity fractions were found at the 0.3 M 
to 0.55 M KCl section of the gradient. These fractions of peak activity are then tested 
- for contaminating deoxyribonucleases (endonucleases and exonucleases) using any 
suiuble procedure. As an example, endonuclease activity may be determined 
5 elcctrophoretically from the change in molecular weight of phage X DNA or supercoiled 
plasmid DNA after incubation witii an excess of DNA polymerase. Similarly, 
exonuclease activity may be determined elcctrophoretically from the change in 
molecular weight of restriction enzyme digested DNA after incubation with an excess of 
DNA polymerase. The fractions tiiat have no deoxyribonuclease activity are pooled and 
10 diafiltered into phosphocellulose buffer containing 50 mM KCl. 

Finally, in a sixth stage, Uie diafiltered pool from stage five is loaded onto a 
phosphocellulose column equilibrated to the correct pH and ionic strength of 25 mfvl 
Tris-Cl (pH 7.5), 50 mM KCl. 0.1 mM EDTA, 0.2% Tween 20, and 0.5 mM DTT. 
The column is tiien washed with the same buffer and eluted witii a Unear 0.05 M to 0.5 
15 M KCl gradient. The peak fractions eluted between 0.215 M and 0.31 M KCl. An 
undegradcd, purified DNA polymerase from these fractions is evidenced by an 
unchanged migration pattern in an in silU activity gel. 

The molecular weight of the DNA polymerase purified from Thermotog^ 
maritima may be deteraiined by any technique, for example, by SDS-PAGE analysis 
20 using protein molecular weight markers or by calculation from die coding sequence. 
The molecular weight of the DNA polymerase purified from Thermotoga mantilM is 
determined by SDS-PAGE to be about 97 kDa. Based on die predicted amino acid 
sequence, the molecular weight is estimated at about 102 kDa. The purification 
protocol of native Ima DNA polymerase is described in more detail in Example 1 . 
25 Purification of die recombinant Tma polymerase of the invention can be earned out with 

similar methodology. 

Biologically active recombinant Ima polymerases of various molecular weights 
can be prepared by die methods and vectors of the present invention. Even when die 
complete coding sequence of the Iroa DNA polymerase gene is present in an 

30 expression vector in E. coH. the cells produce a truncated polymerase, formed by 
translation starting with die methionine codon at position 140. One can also use 
recombinant means to produce a truncated polymerase corresponding to the protein 
produced by initiating translation at die methionine codon at position 284 of die Ima 
coding sequence. The polymerase lacking amino acids 1 diough 139 (about 86 kDa), 

35 and die polymerase lacking amino acids 1 dirough 283 (about 70 kDa) of die wUd type 
Tma polymerase retain polymerase activity but have attenuated 5'->3' exonuclease 
activity. In addition, the 70 kDa polymerase is significandy more diermostable dian 
native Tma polymerase. 
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Thus, the entire sequence of the intact Tma DNA polymerase I enzyme is not 
required for activity. Portions of the Tma DNA polymerase I coding sequence can be 
used in recombinant DNA techniques to produce a biologically active gene product with 
DNA polymerase activity. The availability of DNA encoding the Tma DNA polymerase 

J sequence provides the opportunity to modify the coding sequence so as to generate 
mutein (mutant protein) fomis also having DNA polymerase activity. The amino(N)- 
lerminal portion of the Tma polymerase is not necessary for polymerase activity but 
rather encodes the 5*->3' exonuclease activity of the protein. Using recombinant DNA 
methodology, one can delete approximately up to one-third of the N-terminal coding 

10 sequence of the Tma gene, clone, and express a gene product that is quite active in 
polymerase assays but, depending on the extent of the deletion, has no 5*->3* 
exonuclease activity. Because certain N-terminai shortened forms of the polymerase 
are active, the gene constructs used for expression of these polymerases can include the 
corresponding shortened forms of the coding sequence. 

15 In addition to the N-ierminal deletions, individual amino acid residues in the 

peptide chain of Tma polymerase may be modified by oxidation, reduction, or other 
derivation, and the protein may be cleaved to obtain fragments that retain activity. Such 
alterations that do not destroy activity do not remove the protein from the definition of a 
protein with Tma polymerase activity and so are specifically included within the scope 

20 of the present invention. 

Modifications to the primary structure of the Tma DNA polymerase coding 
sequence by deletion, addition, or alteration so as to change the amino acids 
incorporated into the Tma DNA polymerase during translation of the mRNA produced 
from that coding sequence can be made witiiout destroying die high temperature DNA 

25 polymerase activity of the protein. Such substitutions or other alterations result in the 
production of proteins having an amino acid sequence encoded by DNA falling within 
the contemplated scope of the present invention. Likewise, the cloned genomic 
sequence, or homologous synthetic sequences, of the Trna DNA polymerase gene can 
be used to express a fusion polypeptide with Tma DNA polymerase activity or to 

30 express a proiein wiiii an arnino acid sequence identical to that of native Tma DNA 

polymerase. In addition, such expression can be directed by die Tma DNA polymerase 
gene control sequences or by a control sequence that functions in whatever host is 
chosen to express the Tma DNA polymerase. 

Thus, the present invention provides a coding sequence for Tma DNA 

35, polymerase from which expression vectors applicable to a variety of host systems can 
be constructed and the coding sequence expressed. -Portions of the Tma polymerase- 
encoding sequence are also useful as probes to retrieve other thermostable polymerase- 
encoding sequences in a variety of species. Accordingly, oligonucleotide probes that 
encode at least four to six amino acids can be synthesized and used to retrieve additional 
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DNAs encoding a thermostable polymerase. Because there may not be an exact match 
between the nucleotide sequence of the thermostable DNA polymerase gene of 
Thermotoga maritima and the corresponding gene of other species, oligomers 
containing approximately 12-18 nucleotides (encoding the four to sbc amino sequence) 
5 are usually necessary to obtain hybridization under conditions of sufficient stringency 
to eliminate false positives. Sequences encoding six amino acids supply ample 
information for such probes. Such oligonucleotide probes can be used as primers in 
the degenerate priming method of the invention to obtain themiostable polymerase 
encoding sequences. 

10 The present invention, by providing coding sequences and amino acid 

sequences for Tma DNA polymerase, tiierefore enables the isolation of other 
thermostable polymerase enzymes and tiie coding sequences for those enzymes. The 
amino acid sequence of the Tma DNA polymerase I protein is very similar to the amino 
acid sequences for the tiiermostable DNA polymerases of Tag and Tth. These 
15 similarities facilitated the identification and isolation of the Tma DNA polymerase 
coding sequence. The areas of similarity in tiie coding sequences of Uiese three 
tiiennostable DNA polymerases can be readily observed by aligning tiie sequences. 

However, regions of dissimilarity between the coding sequences of tiie tiiree 
thermostable DNA polymerases can also be used as probes to identify other 
20 thermostable polymerase coding sequences that encode thermostable polymerase 
enzymes. For example, the coding sequence for a thermostable polymerase having 
some properties of laq and other divergent properties of Ima may be identified by 
using probes directed to sequences tiiat encode the regions of dissimilarity between laa 
and Tma . Specifically, such regions include a stretch of four or more contiguous 
25 amino acids from any one or more of the following regions, identified by amino acid 
sequence coordinates (numbering is inclusive): 5-10. 73-79, 113-1 19, 134-145, 
19 M96, 328-340, 348-352, 382-387, 405-414, 467-470, 495-499, 506-512, 
555-559, 579-584, 595-599, 650-655, 732-742, 820-825, 850-856, These regions 
may be considered as "hallmark motifs" and define additional regions of critical amino 
30 acid signature sequences for tiiermostable DNA polymerase functions (e.g. 5'— >3* 
exonuclease activity, 3'^5' exonuclease activity, and DNA polymerase activity). 

One property found in the Tma DNA polymerase, but lacking in native Tag 
DNA polymerase and native Tth DNA polymerase, is 3'->5' exonuclease activity. 
This 3'->5* exonuclease activity is generally considered to be desirable, because 
35 misincorporated or unmatched bases of the syntiiesized nucleic acid sequence are 

eliminated by this activity. Therefore, the fidelity of PGR utilizing a polymerase with 
3*-^5' exonuclease activity (e.g. Tma DNA polymerase) is increased. The 3'-»5' 
exonuclease activity found in Tma DNA polymerase also decreases the probability of 
the formation of primer/dimer complexes in PGR. The 3'->5* exonuclease activity in 



wo 92/03556 



PCT/US91/05753 



15 

effect prevents any extra dNTPs from attaching to the 3* end of the primer in a non- 
template dependent fashion by removing any nucleotide that is attached in a non- 
template dependent fashion. The 3'->5' exonucleaise activity can eliminate single- 
stranded DNAs, such as primers or single-stranded template. In essence, every 3 - 
5 nucleotide of a single-stranded primer or template is treated by the enzyme as 

unmatched and is therefore degraded. To avoid primer degradation in PGR, one can 
add phosphorothioate to the 3* ends of the primers. Phosphoroihioate modified 
nucleotides are more resistant to removal by 3*— >5* exonucleases. 

A "motif or characteristic "signature sequence" of amino acids critical for 
10 3'-->5' exonuclease activity in thermostable DNA polymerases can be defined as 

comprising three shbn domains. Below, these domains are identified as A, B, and C. 
with critical amino acid residues shown in one lencr abbreviation and non-critical 
residues identified as "x." 

Representative 

15 Domain Sequence Trra Coordinates 

A DxExxxL 323-329 

B NxxxDxxxL 385-393 

C YxxxD 464-468 

The distance between region A and region B is 55-65 amino acids. The distance 

20 berween region B and region C is 67-75 amino acids, preferably about 70 amino acids. 
In Tma DNA polymerase, the amino acids that do not define the critical motif signature 
sequence amino acids are L and TSS, respectively, in domain A; LKF and YKV, 
respectively, in domain B; and SCE in domain C. Domain A is therefore DLET5SL; 
domain B is NLKFDYKVL; and domain C is YSCED in TYna DNA polymerase L 

25 Thus, the present invention provides a thermostable DNA polymerase possessing 
3'-^5' exonuclease activity that comprises domains A, B, and C, and, more 
panicularly comprises the sequence D-X-E-X3-L-X55-65-N-X3-D-X3-L-X65-75- Y-X3- 
D, where one letter amino acid abbreviation is used, and X^ represents the number (N) 
of non-critical amino acids between the specified amino acids. 

30 A thermostable 3*— >5* exonuclease domain is represented by an^ino acids 291 

through 484 of Tma DNA polymerase. Accordingly, "domain shuffling" or 
construction of "thermostable chimeric DNA polymerases" may be used to provide 
thermostable DNA polymerases containing novel properties. For example, substitution 
of the Tma DNA polymerase coding sequence comprising codons about 291 through 

35 about 484 for the Thermu?? aquaticus DNA polymerase I codons 289-422 would yield a 
novel thermostable DNA polymerase containing the 5'->3* exonuclease 
domain of Tgg DNA polymerase (1-289). the 3*->5' exonuclease domain of Tma DNA 
polymerase (291-484), and the DNA polymerase domain of Tag DNA polymerase 
(4: 832). Alternatively, the 5'^3* exonuclease domain and the 3'->5' exonuclease 
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domain of Ima DNA polymerase (ca. codons 1-484) may be fused to tlie DNA 
polymerase (dNTP binding and primer/template binding domains) portions of Taa 
DNA polymerase (ca. codons 423-832). The donors and recipients need not be limited 
to laa and Ima DNA polymerases. Ilh DNA polymerase provides analogous domains 
5 as I&3 DNA polymerase. In addition, the enhanced/preferred reverse transcriptase 
properties of Tth DNA polymerase can be further enhanced by the addition of a 3'-^5' 
exonuclease domain as illustrated above. 

While any of a variety of means may be used to generatfe chimeric DNA 
polymerase coding sequences (possessing novel properties), a preferred method 
10 employs "overlap" PGR. In this method, the intended junction sequence is designed 
into the PGR primers (at tiieir 5'-ends). Following the initial amplification of the 
individual domains, the various products are diluted (ca. 100 to 1000-fold) and 
combined, denatured, annealed, extended, and then the final forward and reverse 
primers are added for an otherwise standard PGR. 
15 Thus, the sequence that codes for the 3'->5' exonuclease activity of Tma DNA 

polymerase can be removed from Tma DNA polymerase or added to other polymerases 
tiiat lack tins activity by recombinant DNA methodology. One can even replace, in a 
non-thermostable DNA polymerase, the 3'^5' exonuclease activity domain with the 
thermostable 3'->5' exonuclease domain of Tma polymerase. Likewise, the 3'->5' 
20 exonuclease activity domain of a non-thermostable DNA polymerase can be used to 
replace tiie 3'->5' exonuclease domain of Tnm polymerase (or any other thermostable 
polymerase) to create a useful polymerase of the invention. Those of skill in die art 
recognize that die above chimeric polymerases are most easily constructed by 
recombinant DNA techniques. Similar chimeric polymerases can be constructed by 
25 moving the 5'->3' exonuclease domain of one DNA polymerase to another. 

Whether one desires to produce an enzyme identical to native Tma DNA 
polymerase or a derivative or homologue of that enzyme, the production of a 
recombinant form of Tma polymerase typically involves the construction of an 
expression vector, the transformation of a host cell with the vector, and culture of die 
30 transformed host cell under conditions such diat expression will occur. 

To construct the expression vector, a DNA is obtained that encodes the mature 
(used here to include all chimeras or muteins) enzyme or a fusion of die Tma 
polymerase to an additional sequence that does not destroy activity or to an additional 
sequence cleavable under controlled conditions (such as treatment with peptidase) to 
35 give an active protein. The coding sequence is then placed in operable linkage with 
suitable control sequences in an expression vector. The vector can be designed to 
replicate autonomously in the host cell or to integrate into Uie chromosomal DNA of tl 
host cell. The vector is used to transform a suitable host, and the transformed host is 
cultured under conditions suitable for expression of recombinant Ima polymerase. T 
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Tma polymerase is isolated from the medium or from the cells, although recovery and 
purification of the protein may not be necessary in some instances. 

Each of the foregoing steps can be done in a variety of ways. For example, the 
desired coding sequence may be obtained from genomic fragments and used directly in 
5 appropriate hosts. The construction for expression vectors operable in a variety of 
hosts is made using appropriate replicons and control sequences, as set forth generally 
below. Construction of suitable vectors containing the desired coding and control 
sequences employs standard ligation and restriction techniques that are well understood 
in the art. Isolated plasmids, DNA sequences, or synthesized oligonucleotides are 

10 cleaved, modified, and religated in the form desired. Suitable restriction sites can, if 
not normally available, be added to the ends of the coding sequence so as to facilitate 
construction of an expression vector, as exemplified below. 

Site-specific DNA cleavage is performed by u-eating with suitable restriction 
enzyme (or enzymes) under conditions that are generally understood in the art and 

15 specified by the manufacturers of commercially available restriction enzymes. See, 
e.g.. New England Biolabs, Product Catalog. In general, about 1 jig of plasmid or 
other DNA is cleaved by one unit of enzyme in about 20 |il of buffer solution; in the 
examples below, an excess of restriction enzyme is generally used to ensure complete 
digestion of the DNA. Incubation times of about one to two hours at about 37'C are 

20 typical, although valuations can be tolerated. After each incubation, protein is removed 
by extraction with phenol and chloroform; this extraction can be followed by ether 
extraction and recovery of the DNA from aqueous fractions by precipitation with 
ethanol. If desired, size separation of the cleaved fragments may be performed by 
polyacrylamide gel or agarose gel electrophoresis using standard techniques. See, e.g., 

25 Methods in Enzvmologv . 1980. 65:499-560. 

Restriction-cleaved fragments with single-strand "overhanging" termini can be 
made blunt-ended (double-strand ends) by treating with the large fragment of E. coH 
DNA polymerase I (Klenow) in the presence of the four deoxynucleoside triphosphates 
(dNTPs) using incubation times of about 15 to 25 minutes at 20'C to 25'C in 50 nuM 

30 Tris pH 7.6, 50 mM NaCl, 10 mM MgCl2, 10 rrM DTT, and 5 to 10 "iM dNTPs. The 
Klenow fragment fills in at 5' protruding ends, but chews back protruding 3* single 
strands, even though the four dNTPs are present. If desired, selective repair can be 
performed by supplying only one of the, or selected, dNTPs within the limitations 
dictated by the nature of the protruding ends. After treatment with Klenow, the mixture 

35 is extracted with phenol/chloroform and ethanol precipitated. Similar results can be 
achieved using S 1 nuclease, because treatment under appropriate conditions with S 1 
nuclease results in hydrolysis of any single-stranded portion of a nucleic acid. 

Synthetic oligonucleotides can be prepared using the triester method of 
Matteucci et al-, 1981, J. Am. Chem. Soc. 103:3185-3191, or automated synthesis 
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methods. Kinasing of single strands prior to annealing or for labeUng is achieved using 
an excess, e.g., approximately 10 units, of polynucleotide kinase to 0.5 \iM substrate 
in the presence of 50 mM Tris. pH 7.6. 10 mM MgCli. 5 mM diihiothreitol (DTT). and 
1 to 2 ATP. If kinasing is for labeUng of probe, the ATP will contain high specific 

5 activity y-^^?- 

Ligations are performed in 1 5-30 ^1 volumes imder the foUowing standard 

conditions and temperatures: 20 mM Tris-Cl. pH 7.5. 10 mM MgClz. 10 mM DTT. 33 

^ig/ml BSA. 10 niM-50 mM NaCl. and either 40 \iM ATP and 0.01-0.02 (Weiss) units 

T4 DNA ligase at 0*C (for ligation of fragments with complementary single-stranded 

10 ends) or 1 mM ATP and 0.3-0.6 units T4 DNA ligase at M'C (for "blunt end" 

Ugation). Intermolecular ligations of fragments with complementary ends are usually 

performed at 33-100 ng/ml total DNA concentrations (5 to 100 nM total ends 

concentration). Intemiolecular blunt end ligations (usually employing a 20 to 30 fold 

molar excess of linkers, optionally) are performed at 1 nM total ends concentration. 

1 5 In vector consffuction, the vector fragment is commonly treated with bacterial or 

calf intestinal alkaline phosphatase (BAP or CIAP) to remove the phosphate and 

prevent religation and reconstruction of the vector. BAP and CIAP digestion 

conditions are well known in the art. and published protocols usually accompany the 

commercially available BAP and CIAP enzymes. To recover the nucleic acid 

20 fragments, the preparation is extracted with phenol-chloroform and ethanol precipitated 

to remove AP and purify the DNA. Alternatively, religation can be prevented by 

restriction enzyme digestion of unwanted vector fragments before or after ligation with 

the desired vector. 

For portions of vectors or coding sequences that require sequence 

25 modifications, a variety of site-specific primer-directed mutagenesis metiiods are 

available. The polymerase chain reaction (PCR) can be used to perform site-specific 
mutagenesis. In another technique now standard in tiie an, a synthetic oligonucleotide 
encoding the desired mutation is used as a primer to direct synUiesis of a 
complementary nucleic acid sequence of a single-stranded vector, such as pBS13-^. tiiat 

30 serves as a template for construction of the extension product of tiie mutagenizing 
primer. The mutagenized DNA is transfomied into a host bacterium, and cultures of 
the transformed bacteria are plated and identified. The identification of modified 
vectors may involve transfer of the DNA of selected transformants to a nitrocellulose 
filter or other membrane and the "lifts" hybridized with kinased synOietic primer at a 

35 temperature that permits hybridization of an exact match to the modified sequence but 
prevents hybridization with the original strand. Transformants tiiat contain DNA Uiat 
hybridizes with the probe are tiien cultured and seive as a reservoir of tiie modified 
DNA. 
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In the constructions set forth below, correct ligations for plasnodd construction 
are confmned by first transforming L ££2li strain DGlOl or another suitable host with 
the ligation mixture. Successful transformants are selected by ampicillin, tetracycline or 
other antibiotic resistance or sensitivity or by using other markers, depending on the 

5 mode of plasmid construction, as is understood in the an. Plasmids from the 

transforaiants are then prepared according to the method of Clewell gt al., 1969, Proc . 
IM. Apaj. Sei. USA 1 1 59, optionally following chloramphenicol amplification 
(Clewell, 1972, 1. Bacteriol . 11Q:667). Another method for obtaining plasmid DNA is 
described as the "Base-Acid" extraction method at page 1 1 of the Bethesda Research 

10 Laboratories publication Focus , volume 5, number 2, and very pure plasmid DNA can 

be obtained by replacing steps 12 through 17 of the protocol with CsCl/ethidium 

bromide ultracentrifugation of the DNA. The isolated DNA is analyzed by restriction 

enzyme digestion and/or sequenced by the dideoxy method of Sanger si al- 1977, 

Ems. Nail- Agad . Sci. USA 74:5463. as further described by Messing £i ai.. 1981, 

15 Niic, Acids Ees. 2:309. or by the method of Maxam£ial., 1980. Methods in 
Enzvmology 65:499. 

The control sequences, expression vectors, and transformation methods are 
dependent on the type of host cell used to express the gene. Generally, prxKaryotic, 
yeast, insect, or mammalian cells are used as hosts. Procaryoric hosts are in general the 

20 most efficient and convenient for the production of recombinant proteins and are 
therefore preferred for the expression of Tma polymerase. 

The procaryoie most frequently used to express recombinant proteins is E. coli . 
For cloning and sequencing, and for expression of constructions under control of most 
bacterial promoters, E. coH K12 strain MM294, obtained from the E. coH Genetic 

25 Stock Center under GCSC ^^6135, can be used as the host. For expression vectors 
with the PlNrbs control sequence, E. coH K12 strain MClOOO lambda lysogen, 
N7N53CI857 SusPgo. ATCC 39531, may be used. E. ££ili DGl 16, which was 
deposited with the ATCC (ATCC 53606) on April 7. 1987, and E. cqH KB2, which 
was deposited with the ATCC (ATCC 53075) on March 29, 1985, are also useful host 

30 cells. For M 1 3 phage recombinants. coH strains susceptible to phage infection, such 
as E. coli K12 strain DG98, are employed. The DG98 strain was deposited with the 
ATCC (ATCC 39768) on July 13, 1984. 

However, microbial strains other than E. coH can also be used, such as bacilli, 
for example Paciligs SUlziiiis. various species of Pseudomonas . and other bacterial 

35 strains, for recombinant expression of Tma DNA polymerase. In such procaryotic 
systems, plasmid vectors that contain replication sites and control sequences derived 
from the host or a species compatible with the host are typically used. 

For example, E. coH is typically transformed using derivatives of pBR322, 
described by Bolivar ei al.. 1977, Gene 2:95. Plasmid pBR322 contains genes for 
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ampicillin and tetracycline resistance. These drug resistance markers can be either 
retained or destroyed in constructing the desired vector and so help to detect the 
presence of a desired recombinant. Commonly used procaryotic control sequences, 
i.e., a promoter for transcription initiation, optionally with an operator, along with a 
5 riboson;e binding site sequence, include the P-lactamase (peniciUinase) and lactose (lac) 
promoter systems (Chang etal-. 1977. Nature 198:1056). the tryptophan (trp) 
promoter system (Goeddel £i al- 1980, Nuc. Adds RfiS- fi:4057), and the lambda- 
derived Pl promoter (Shimatake sial-. 1981. Naium 222:128) and N-gene ribosome 
binding site (Nrbs). A portable control system cassette is set forth in United States 
10 Patent No. 4,711,845, issued December 8, 1987. This cassette comprises a Pl 
promoter operably linked to the Nrbs in turn positioned upstream of a third DNA 
sequence having at least one restriction site that permits cleavage witiun six bp 3' of the 
Nrbs sequence. Also useful is the phosphatase A (phoA) system described by Chang 
in European Patent Publication No. 196,864, published October 8, 1986. 
15 However, any available promoter system compatible wiih procaryotes can be used to 
construct a Tma expression vector of the invention. 

The nucleotide sequence of tiie Jm insert may negatively affect Uie efficiency 
of tiie upstream ribosomal binding site, resulting in low levels of translated polymerase. 
The translation of tiie Ima gene can be enhanced by tiie construction of "translationally 
20 coupled" derivatives of the expression vectors. An expression vector can be 

constmcted witii a secondary translation initiation signal and short coding sequence just 
upstream of die Ima gene coding sequence such that die stop codon for die short 
coding sequence is "coupled" witii tiie ATG start codon for tiie Ima gene coding 
sequence. A secondary translation initiation signal tiiat efficientiy initiates translation 
25 can be insened upstream of the Trna gene start codon. Translation of the shon coding 
sequence brings tiie ribosome into close proximity witii tiie Ima gene translation 
initiation site, thereby enhancing translation of the Ima gene. For example, one 
expression system can utilize tiie translation initiation signal and first ten codons of the 
T7 bacteriophage major capsid protein (gene 10) fused in-frame to the last sbt codons 
30 of TrpE. The TGA (stop) codon for TrpE is "coupled" witii the ATG (start) codon for 
the Ima gene coding sequence, forming the sequence TGATG. A one base Jrame-shift 
is required between translation of the short coding sequence and ffanslation of tiie Tma 
coding sequence. These derivative expression vectors can be constructed by 
recombinant DNA metiiods. 
35 The redundancy of the genetic code can also be related to a low translation 

efficiency. Typically, when multiple codons coding tiie same amino acid occur, one of 
the possible codons is preferentially used in an organism. Frequently, an organism 
accumulates tiie tRNA species corresponding to the preferred codons at a higher level 
than those corresponding to rarely used codons. If tiie pattern of codon usage differs 
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between Thennotoga maririma and the host cell, the tRNA species necessary for 
translation of the Tma polymerase gene may be in low abundance. In the Tma coding 
sequence, arginine is most frequently coded for by the "AGA" codon, whereas this 
codon is used at low frequency in £. qqH genes, and the corresponding tRNA is present 

5 in low concentration in £. £j2li host cells. Consequently, the low concentration in the 
E, coH host cell of "Arg U" tRNA for the "AGA" condon may limit the translation 
efficiency of the Tma polymerase gene RNA in £. cqH host cells. The efficiency of 
translation of the Tma coding sequence within an E. cqU host cell may be improved by 
increasing the concentration of this Arg iRNA species by expressing multiple copies of 

10 this tRNA gene in the host cell. 

In addition to bacteria, eucaryotic microbes, such as yeast, can also be used as 
recombinant host cells. Laboratory strains of Saccharomvccs ccrevisiae > Baker's yeast, 
are most often used, although a number of other sn*ains are commonly available. While 
vectors employing the two micron origin of replication are common (Broach, 1983, 

15 Meth . Enz. 101 :307). other plasmid vectors suitable for yeast expression are known 
(see, for example, Siinchcomb £i al.. 1979, Nature 282:39: Tschempe £i al., 1980, 
Q£Il£lQ:157; and Clarke fcial- 1983, Msitl. Enz. 101:300). Control sequences for 
yeast vectors include promoters for the synthesis of glycolytic enzymes (Hess £l 
1968, 1, Adv. Enzvme Reg. 2: 149; Holland £l fll-, 1978, Biotechnology 12:4900; and 

20 Holland £1 21m 1981. J. Biol . Chem . ? 56:1385). Additional promoters known in tiie 
art include the promoter for 3-phosphoglycerate kinase (Hitzeman ei al., 1980, 1. Biol . 
Chem . 222:2073) and those for other glycolytic enzymes, such as glyceraldehyde 3- 
phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofhictokinase, 
glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, 

25 triosephosphate isomerase, phosphoglucose isomerase, and glucokinase. Other 
promoters that have Uie additional advantage of transcription controlled by growth 
conditions are the promoter regions for alcohol dehydrogenase 2, isocytochrome C, 
acid phosphatase, degradative enzymes associated with nitrogen metabolism, and 
enzymes responsible for maltose and galactose utilization (Holland, supra ). 

30 Terminator sequences may also be used to enhance expression when placed at 

the 3' end of the coding sequence. Such terminators are found in the 3* untranslated 
region following the coding sequences in yeast-derived genes. Any vector containing a 
yeast-compatible promoter, origin of replication, and other control sequences is suitable 
for use in constructing yeast Tma expression vectors. 

35 The Tma gene can also be expressed in eucaryotic host cell cultures derived 

. from multicellular organisms. See, for example. Tissue Culture , Academic Press, Cruz 
and Patterson, editors (1973). Useful host cell lines include COS-7, C0S-A2, CV-1. 
murine cells such as murine myelomas N51 and VERO, HeLa cells, and Chinese 
hamster ovary (CHO) cells. Expression vectors for such cells ordinarily include 
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promoters and control sequences compauble with mammalian cells such as, for 
example, the commonly used early and late promoters from Simian Virus 40 (SV 40) 
(Fiers £l a!-. 1978, Nature 221:113), or other viral promoters such as those derived 
from polyoma, adenovirus 2. bovine papilloma virus (BPV), or avian sarcoma viruses, 
5 or immunoglobulin promoters and heat shock promoters. A system for expressing 
DNA in mammalian systems using a BPV vector system is disclosed in U.S. Patent 
No. 4,419,446. A modification of this system is described in U.S. Patent No. 
4,601 ,978. General aspects of mammalian cell host system transformations have been 
described by Axel. U.S. Patent No. 4.399.216. "Enhancer" regions are also important 
10 in optimizing expression; these are. generally, sequences found upstream of the 

promoter region. Origins of replication may be obtained, if needed, from viral sources. 
Hovi/ever. integration into die chromosome is a common mechanism for DNA 
replication in eucaryotes. 

Plant cells can also be used as hosts, and control sequences compatible with 
15 plant cells, such as the nopaline synthase promoter and polyadenylation signal 

sequences (Depickcr £i al-. 1982, i. MsA- hM- GfiH. 1:561) are available. Expression 
systems employing insect cells utilizing the control systems provided by baculovirus 
vectors have also been described (Miller si al-, 1986. Genetic Enging^ng (SeUow ei 
ai.. eds.. Plenum Publishing) £-.277-297). Insect cell-based expression can be 
20 accompUshed in Spodopiera fru^peida . These systems can also be used to produce 
recombinant Tma polymerase. 

Depending on the host cell used, transformation is done using standard 
techniques appropriate to such cells. The calcium treatment employing calcium 
chloride, as described '-y Cohen, 1972, Pro£. JM- Acad . Sd- USA 62:21 10 is used 
25 for procaiyotes or otiier cells that contain substantial cell wall barriers. Infection with 
Agrobacterium tumefaciens (Shaw £t al-. 1983, Gsoe 22:3 15) is used for certain plant 
cells. For mammalian ceUs. the calcium phosphate precipitation metiiod of Graham and 
van der Eb, 197 8, Virolo^ 52:546 is preferred. Transformations into yeast arc carried 
out according to the method of Van Solingen £i M-, 1977, l Eaci. 122:946 and Hsiao 
30 et al., 1979, Proc . Natl . Acad . Sci . USA 76:3829. 

Once the Tnia DNA polymerase has been expressed in a recombinant host ceU, 
purification of the protein may be desired. Although a variety of purification 
procedures can be used to purify the recombinant thermostable polymerase of the 
invention, fewer steps may be necessary to yield an enzyme preparation of equal purity. 
35 Because 1. £Qli host proteins are heat-sensitive, the recombinant thermostable Ima 
DNA polymerase can be substantially enriched by heat inactivating the crude lysate. 
This step is done in die presence of a sufficient amount of salt (typically 0.3 M 
ammonium sulfate) to ensure dissociation of the Tina DNA polymerase from die host 
DNA and to reduce ionic interactions of Tma DNA polymerase with other cell lysate 
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proteins. In addition, the presence of 0.3 M ammonium sulfate promotes hydrophobic 
interaction with a phenyl sepharose column. Hydrophobic interaction chromatography 
is a separation technique in which substances are separated on the basis of differing 
strengths of hydrophobic interaction with an uncharged bed material containing 
5 hydrophobic groups. Typically, the column is Tvrsi equilibrated under conditions 
favorable to hydrophobic binding, such as high ionic strength. A descending salt 
gradient may then be used to elute the sample. 

According to the invention, an aqueous mixture (containing either native or 
recombinant Tma DNA polymerase) is loaded onto a column containing a relatively 
10 strong hydrophobic gel such as phenyl sepharose (manufactu'-ed by Pharmacia) or 

Phenyl TSK (manufactured by Toyo Soda). To promote hyorophobic interaction with 
a phenyl sepharose column, a solvent is used that contains, for example, greater than or 
equal to 0.3 M ammonium sulfate, with 0.3 M being prefenned, or greater than or equal 
to 0.5 M NaCl. The column and the sample are adjusted to 0.3 M ammonium sulfate in 
15 50 m^4 Tris (pH 7.5) and 1.0 hlM EDTA C'TE") buffer that also contains 0.5 mM 
DTT, and tiie sample is applied to the column. The column is washed with the 0.3 M 
ammonium sulfate buffer. The enzyme may then be eluted with solvents that attenuate 
hydrophobic interactions, such as decreasing salt gradients, ethylene or propylene 
glycol, or urea. For native Tma DNA polymerase. . preferred embodiment involves 
20 washing the column with 2 M urea and 20% eUiyleric glycol in TE-DTT. 

For long-term stability, Tma DNA polymerase enzyme can be stored in a buffer 
that contains one or more non-ionic polymeric detergents. Such detergents are 
generally those that have a molecular weight in die range of approximately 100 to 
250,000 daltons, preferably about 4,000 to 200,000 daltons, and stabilize die enzyme 
25 at a pH of from about 3.5 to about 9.5, preferably from about 4 to 8.5. Examples of 
such detergents include those specified on pages 295-298 of McCutcheon's Emulsifiers 
& Detergents . North American edition (1983). published by the McCutcheon Division 
of MC Publishing Co., 175 Rock Road, Glen Rock, NJ (USA) and copending Serial 
No, 387.003, filed July 28, 1989, each of which is incorporated herein by rcfcfence. 
30 Preferably, the detergents are selected from die group 'comprising ethoxylated 

fatty alcohol ethers and lauryl ethers, ethoxylated alkyl pheno*.,, octylphenoxy 
polyethoxy ethanol compounds, modified oxyethylated and/or ox>T3ropylaied straight- 
chain alcohols, polyethylene glycol monooleate compounds, polysorbate compounds, 
and phenolic fatty alcohol ethers. More pardcularly preferred are Tween 20, a 
35 polyoxyethylated (20) sorbiian monolaurate from ICI Americas Inc., Wilmington. DE. 
and Iconol NP-40, an ethoxylated alkyl phenol Xnonyl) from BASF Wyandotte Corp. 
Parsippany, NJ. 

The thermostable enzyme of this invention may be used for any purpose in 
which such enzyme activity is necessary or desired. In a particularly preferred 
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embodiment, the enzyme catalyzes the nucleic acid amplification reaction known as 
PCR. This process for amplifying nucleic acid sequences is disclosed and claimed in 
U.S. Patent Nos. 4,683,202 and 4.865,188, each of which is incorporated herein by 
reference. The PCR nucleic acid amplification method involves amplifying at least one 
5 specific nucleic acid sequence contained in a nucleic acid or a mixture of nucleic acids 
and in the most common embodiment, produces double-stranded DNA. 

For ease of discussion, the protocol set forth below assumes that the specific 
sequence to be amplified is contained in a double-suranded nucleic acid. However, the 
process is equally useful in amplifying single-stranded nucleic acid, such as mRNA, 
10 although in the preferred embodiment the ultimate product is still double-stranded 

DNA. In the amplification of a single-stranded nucleic acid, the first step involves the 
synthesis of a complementary strand (one of the two amplification primers can be used 
for this purpose), and the succeeding steps proceed as in the double-stranded 
amplification process described below. 
15 This amplification process comprises the steps of: 

(a) contacting each nucleic acid strand with four different nucleoside 
uiphosphates and two oligonucleotide primers for each specific sequence being 
amplified, wherein each primer is selected to be substantially complementary to die 
different strands of the specific sequ';.ice, such that the extension product synthesized 

20 from one primer, when separated from its complement, can serve as a template for 
synUicsis of the extension product of the other primer, said contacting being at a 
temperature that allows hybridization of each primer to a complementary nucleic acid 
strand; 

(b) contacting each nucleic acid strand, at the same time as or after step (a), with 
25 a FjNA polymerase from Thennotoga maritima that enables combination of the 

nucleoside triphosphates to form primer extension products complementary to each 
strand of the specific nucleic acid sequence; 

(c) maintaining the mixture from step (b) at an effective temperamre for an 
effective time to promote tlie activity of ilie enzyme and to synthesize, for each different 

30 sequence being amplified, an extension product of each primer that is complementary to 
each nucleic acid strand template, but not so high as to separate each extension product 
from the complementary sffand template; 

(d) heating the mixture from step (c) for an effective time and at an effective 
temperature to separate the primer extension products from the templates on which Uiey 

35 were synthesized to produce single-stranded molecules but not so high as to denature 

irreversibly die enzyme; 

(e) cooling the mixture from step (d) for an effective time and to an effective 
temperature to promote hybridization of a primer to each of the single-stranded 
molecules produced in step (d); and 
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(0 maintaining the mixture from step (e) at an effective temperature for an 
effective time to promote the activity of the enzyme and to synthesize, for each different 
sequ'^nce being amplified, an extension product of each primer that is complementary to 
eac.t .lucleic acid template produced in step (d) but not so high as to separate each 
5 extension product from the complementary strand template. The effective times and 
temperatures in steps (e) and (0 may coincide, so that steps (e) and (f) can be carried 
out simultaneously. Steps (d)-(O are repeated until the desired level of amplification is 
obtained. 

The amplification method is useful not only for producing large amounts of a 

10 specific nucleic acid sequence of known sequence but also for producing nucleic acid 
sequences that are known to exist but are not completely specified. One need know 
only a sufficient number of bases at botii ends of the sequence in sufficient detail so that 
two oligonucleotide primers can be prepared thai will hybridize to different strands of 
the desired sequence at relative positions along the sequence such that an extension 

15 product synthesized from one primer, when separated from ti;e template (complement), 
can serve as a template for extension of the jther primer into a nucleic acid sequence of 
defined length. The greater the knowledge about the bases at both ends of the 
sequence, the greater can be the specificity of the primers for the target nucleic acid 
sequence and the efficiency of the process, 

20 In any case, an initial copy of the sequence to be amplified must be available, 

although the sequence need not be pure or a discrete molecule. In general, the 
amplification process involves a chain reaction for producing, in exponential quantities 
relative to the number of reaction steps involved, at least one specific nucleic acid 
sequence given that (a) tiie ends of the required sequence are known in sufficient detail 

25 that oligonucleotides can be synthesized that will hybridize to tiiem and (b) that a small 
amount of the sequence is available to initiate the chain reaction. The product of the 
chain reaction will be a discrete nucleic acid duplex with termini corresponding to the 5* 
ends of the specific primers employed. 

Any nucleic acid sequence, in purified or nonpurified form, can be utilized as 

30 the starting nucleic acid(s), provided it contains or is suspected to contain the specific 
nucleic acid sequence one desires to amplify. The nucleic acid to be amplified can be 
obtained from any source, for example, from plasmids such as pBR322, from cloned 
DNA or RNA, or from natural DNA or RNA from any source, including bacteria, 
yeast, viruses, organelles, and higher organisms such as plants and animals, DNA or 

35 RNA may be extracted from blood, tissue material such as chorionic villi, or amniotic 
cells by a variety of techniques. See, e.g., Maniaus et al., supra , pp. 280-281. Thus, 
the process may employ, for example, DNA or RNA, including messenger RNA, 
which DNA or RNA may be single-stranded or double-stranded. In addition, a DNA- 
RNA hybrid that contains one su^ind of each may be utilized. A mixture of any of these 
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nucleic acids can also be employed as can nucleic acids produced firom a previous 
amplificadon reaction (using the same or different primers). The specific nucleic acid 
sequence to be amplified can be only a fraction of a large molecule or can be present 
initially as a discrete molecule, so that the specific sequence constitutes the entire 
5 nucleic acid. 

The sequence to be amplified need not be present initially in a pure form; die 
sequence can be a minor fraction of a complex mixture, such as a portion of the (3- 
giobin gene contained in whole human DNA (as exemplified in Saiki fiial- 1985, 
Science 230 :1530-1534) or a portion of a nucleic acid sequence due to a particular 
10 microorganism, which organism might constitute only a very minor fraction of a 

particular biological sample. The cells can be directiy used in die amplification process 
after suspension in hypotonic buffer and heat treatment at about 90 C-100 C until ceU 
lysis and dispersion of intracellular components occur (generally 1 to 15 minutes). 
After the heating step, the amplification reagents may be added directly to the lysed 
1 5 cells. The starting nucleic acid sequence can contain more than one desired specific 

nucleic acid sequence. The amplification process is useful not only for producing large 
amounts of one specific nucleic acid sequence but also for amplifying simultaneously 
more than one different specific nucleic acid sequence located on the sanne or different 
nucleic acid molecules. 
20 Primers play a key role in the PGR process. The word "primer" as used in 

describing the amplification process can refer to more than one primer, particularly in 
the case where there is some ambiguity in Uie information regarding the terminal 
sequence(s) of the fragment to be amplified or where one employs the degenerate 
primer process of the invention. For instance, in the case where a nucleic acid 
25 sequence is inferred fi-om protein sequence information, a collection of primers 

containing sequences representing all possible codon variations based on degeneracy of 
the genetic code will be used for each su^d. One primer from this collection will be 
sufficienUy homologous with the end of the desired sequence to be amplified to be . 
useful for amplification. 
30 In addition, more than one specific nucleic acid sequence can be ampUfied from, 

the fu^t nucleic acid or mixture of nucleic acids, so long as the appropriate number of 
different oligonucleotide primers are utilized. For example, if two different specific 
nucleic acid sequences are to be produced, four primers are utiUzed. Two of the 
primers are specific for one of the specific nucleic acid sequences, and the other two 
35 primers are specific /or the second specific nucleic acid sequence. In this manner, each 
of tiie two different specific sequences can be produced exponentially by die present 
process. 

A sequence within a given sequence can be amplified after a given number of 
amplification cycles to obtain greater specificity in die reaction by adding, after at least 
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one cycle of amplification, a set of primers that are complementary to internal sequences 
(i.e., sequences that are not on the ends) of the sequence to be amplified. Such primers 
can be added at any stage and will provide a shorter amplified fragment Alternatively, 
a longer fragment can be prepared by using primers with non-complementary ends but 

5 having some overlap with the primers previously utilized in the amplification. 

Primers also play a key role when the amplification process is used for in vitro 
mutagenesis. The product of an amplification reaction where the primers employed are 
not exacUy complementary to the original template will contain the sequence of the 
primer rather than the template, so introducing an in vitro mutation. In further cycles, 

10 this mutation will be amplified with an undiminished efficiency because no further 
raispaired priming is required. The process of making an altered DNA sequence as 
described above could be repeated on the altered DNA using different primers to induce 
further sequence changes. In this way, a series of mutated sequences can gradually be 
produced wherein each new addition to the series differs from Uie last in a minor way, 

15 but from the original DNA source sequence in an increasingly major way, 

Bec?Mse the primer can contain as part of its sequence a non-complementary 
sequence, provided that a sufficient amount of the primer contains a sequence that is 
complementary to the strand to be amplified, many other advantages can be realized. 
For example, a nucleotide sequence that is not complementary to the template sequence 

20 (such as, e.g., a promoter, linker, coding sequence, etc.) may be attached at the 5* end 
of one or both of the primers and so appended to the product of the amplification 
process. After the extension primer is added, sufficient cycles are run to achieve the 
desired amount of new template containing the non-complementary nucleotide insert. 
This allows proiuction of large quantities of the combined fragments in a relatively 

25 short period of time (e.g., two hours or less) using a simple technique. 

Oligonucleotide primers can be prepared using any suitable method, such as, 
for example, the phosphotriesier and phosphodiester methods described above, or 
automated embodiments thereof. In one such automated embodiment, 
diethylphosphoramidiies are used as starting materials and can be synthesized as 

30 described by Beaucage et a!., 1981 , Tetrahedron Lcncrs 22: 1 859-1 862, One metiiod 
for synthesizing oligonucleotides on a modified solid suppon is described in U.S. 
Patent No. 4,458,066. One can also use a primer that has been isolated from a 
biological source (such as a restriction endonuclease digest). 
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No matter what primers are used, however, the reacUon ioixture must contain a 
tempMte for PGR to cccor. because the specific nucleic acid sequence is produced by 
using a nucleic acid containing that sequence as a template. The first step involves 
contacting each nucleic acid strand witl^ four different nucleoside triphosphates and two 
5 oligonucleotide primers for each specific nucleic add sequence being amplified or 
detected If the nucleic acids to be amplified or detected are DNA. then the nucleoside 
triphosphates are usually dATP. dCTP. dGTP. and dTIP. although various nucleoude 
derivatives can also be used in the process. The concentration of nucleoside 
tiiphosphates can vary widely. Typically, the concentration is 50 to 200 pM in each 
10 dNTP in the buffer for amplification, and MgCh is present in tiie buffer in an amount 
of 1 to 3 mM to activate the polymerase and increase the specificity of die reaction. 
However. dNTP concentrations of 1 to 20 may be preferred for some applications, 
such as DNA sequencing or generating radiolabeled probes at high specific activity. 
The nucleic acid strands of the target nucleic acid serve as templates for die 
1 5 synthesis of additional nucleic acid strands, which are extension products of die 
primers. This syntiiesis can be perf-ormed using any suitable method, but generally 
occurs in a buffered aqueous solution, preferably at a pH of 7 to 9. most preferably 
about 8. To faciUtate synthesis, a molar excess of the two oligonucleotide primers is 
added to the buffer containing the template strands. As a practical matter, die amount of 
20 primer added will generally be in molar excess over die amount of complementary 
strand (template) when the sequence to be amplified is contained in a mixture of 
compUcated long-chain n.icleic acid strands. A large molar excess is preferred to 
improve die efficiency of the process. Accordingly, primer-.templatc ratios of at least 
1000:1 or higher are generally employed for cloned DNA templates, and primer: 
25 template ratios of about 108: 1 or higher are generally employed for amplification from 
complex genomic samples. 

The mixture of template, primers, and nucleoside triphosphates is then treated 
according to whether die nucleic acids being amplified or detected are double- or single- 
stranded. If die nucleic acids are single-stranded, then no denaturation step need t>c . 
30 employed prior to die first extension cycle, and die reaction niixmre is held at a 
temperanire diat promotes hybridization of the primer to its complementary target 
(template) sequence. Such temperature is generally from about 35-C to 65*C or more, 
preferably about 3rC to 60'C for an effective time, generally from a few seconds to 
five minutes, preferably from 30 seconds to one minute. A hybridization temperature 
35 of 35'C to 70*C may be used for Ima DNA polymerase. Primers diat are 15 
nucleotides or longer in length are used to increase die specificity of primer 
hybridization. Shoner primers require lower hybridization temperatures. 

The complement to the original single-stranded nucleic acids can be syndiesized 
by adding Tma DNA polymerase in the presence of the appropriate buffer. dNTPs. and 
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one or more oligonucleotide primers. If an appropriate single primer is added, the 
primer extension product will be complementary to the single-stranded nucleic acid and 
will be hybridized with the nucleic acid strand in a duplex of strands of equal or 
unequal length (depending on where the primer hybridizes to the template), which may 

5 then be sepai^ated into single strands as described above to produce two single, 
separated, complementary strands. A second primer would then be added so that 
subsequent cycles of primer extension would occur using both the original single- 
strar-ded nucleic acid and the extension product of the first primer as templates. 
Alternatively, two or more appropriate primers (one of which will prime synthesis 

10 using the extension product of the other primer as a template) can be added to the single- 
stranded nucleic acid and the reaction carried out 

If the nucleic acid contains two strands, as in the case of amplification of a 
double-stranded target or second-cycle amplification of a single-stranded target, the 
strands of nucleic acid must be separated before the primers are hybridized. This strand 

15 separation can be accomplished by any suitable denaturing method, including physical, 
chemical or enzymatic means. One preferred physical method of separating the strands 
of the nucleic acid involves heating the nucleic acid until complete (>99%) dcnaturation 
occurs. Typical heat dcnaturation involves temperatures ranging from about 80'C to 
105'C for times generally ranging from about a few seconds to minutes, depending on 

20 the composition and size of the nucleic acid. Preferably, the effective denaturing 

temperature is 90'C-100*C for a few seconds to 1 minute. Strand separation may also 
be induced by an enzyme from the class of enzymes known as helicases or the enzyme 
RecA, which has hclicase activity and in the presence of riboATP is known to denature 
DNA. The reaction conditions suitable for separating the strands of nucleic acids with 

25 helicases are described by Kuhn Hoffmann-Berling, 1978, CSH-Ouantitarive Biology 
42:63, and techniques for using RecA are reviewed in Radding, 1982, Ann . Rev . 
Generics 1^:405-437. The denaturation produces two separated complementary strands 
"of equal or unequal length. 

If the double-stranded nucleic acid is denamred by lieat, the reaction niixturc iS 

30 allowed to cool to a temperature that promotes hybridization of each primer to the 
complementary target (template) sequence. This tcmperamre is usually from about 
35*C to 65*C or more, depending on reagents, preferably 37*C to 60*C. The 
hybridization temperature is maintained for an effective time, generally a few seconds to 
minutes, and preferably 10 seconds to 1 minute. In practical terms, the temperature is 

35 simply lowered ft'om about 95*C to as low as 37*C, and hybridization occurs at a 
temperature within this range. _ . 

Whether the nucleic acid is single- or double-stranded, the DNA polymerase 
from Thermotoga maririma can be added prior to or during the denaturation step or 
when the temperature is being reduced to or is in die range for promoting hybridization. 
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Although the thermostability of Jm polymerase allows one to add Tma polymerase to 
the reaction mixture at any time, one can substantially inhibit non-specific amplification 
by adding the polymerase to the reaction mixture at a point in time when the mbcture 
wiU not be cooled below the stringent hybridization temperamre. After hybridization, 

5 the reaction nibcture is tiien heated to or maintained at a temperature at which the activity 
of the enzyme is promoted or optimized, i.e., a temperature sufficient to increase the 
activity of the enzyme in facilitating synthesis of the primer extension products from the 
hybridized primer and template. The temperature must actually be sufficient to 
synthesize an extension product of each primer that is complementary to each nucleic 

10 acid template, but must not be so high as to denanu^ each extension product from its 
complementary template (i.e., the temperature is generally less tiian about 80*C to 
90*C). 

Depending on tiie nucleic acid(s) employed, the typical temperature effective for 
this synthesis reaction generally ranges from about 40*C to 80'C, preferably 50*C to 
15 75*C. The temperature more preferably ranges from about 65*C to 75*C for 

Thermotoga maririma DNA polymerase. The period of time required for this synthesis 
may range from about 10 seconds to several minutes or more, depending mainly on the 
temperature, the length of the nucleic acid, the enzyme, and the complexity of the 
nucleic acid mixture. The extension time is usually about 30 seconds to a few minutes. 
20 If the nucleic acid is longer, a longer time period is generally required for 
complementary strand synthesis. 

The newly synthesized strand and Uie complement nucleic acid sorand form a 
double-stranded molecuk. Uiat is used in the succeeding steps of the amplification 
process. In the next step, the strands of tiie double- stranded molecule are separated by 
25 heat denatuiation at a temperanire and for a time effective to denature tiie molecule, but 
not at a temperature and for a period so long that die thermostable enzyme is completely 
and irreversibly denatured or inactivated. After this denaturation of template, the 
temperature is decreased to a level that promotes hybridization of tiie primer to tiie 
complementary single-stranded molecule (template) produced from the previous step, 

30 as described above. 

After tills hybridization step, or concurrently with tiie hybridization step, tiie 
temperature is adjusted to a temperature tiiat is effective to promote die activity of tiie 
tiiermostSDle enzyme to enable synthesis of a primer extension product using as a 
template both the newly synUiesized and the original strands. The temperature again 

35 must not be so high as to separate (denature) the extension product from its template, as 
described above. Hybridization may occur during this step, so that the previous step of 
cooling after denaturation is not required. In such a case, using simultaneous steps, die 
preferred temperature range is 50*C to 70*C. 
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The heating and cooling steps involved in one cycle of strand separation, 
hybridization, and extension product synthesis can be repeated as many times as needed 
to produce the desired quantity of the specific nucleic acid sequence. The only 
limitation is the amount of the primers, thermostable enzyme, and nucleoside 

5 triphosphates present. Usually, from 15 to 30 cycles are completed. For diagnostic 
detection of amplified DNA, the number of cycles will depend on the nature of the 
sample and the initial target concentration in the sample. For example, fewer cycles 
will be required if the sample being amplified is pure. If the sample is a complex 
mixture of nucleic acids, more cycles will be required to amplify the signal sufficienUy 

10 for detection. For general amplification and detection, the process is repeated Lbout 15 
times. When amplification is used to generate sequences to be detected with labeled 
sequence-specific probes and when human genomic DNA is the target of amplification, 
the process is repeated 15 to 30 times to amplify the ^sequence sufficientiy so that a 
clearly detectable signal is produced, i.e., so that background noise does not interfere 

15 with detection. 

No additional nucleosides, primers, or thermostable enzyme need be added after 
the initial addition, provided that no key reagent has been exhausted and that the 
enzyme has not become denatured or irreversibly inactivated, in which case addition j! 
polymerase or other reagent would have to be added for the reaction to continue, 

20 Addition of such materials at each step, however, will not adversely affect the reaction. 
After the appropriate number of cycles has been completed to produce the desired 
amount of the specific nucleic acid sequence, the reaction can be halted in the usual 
manner, e.g., by inactivating the enzyme by adding EDTA, phenol, SDS, or CHCI3 or 
by separating the components of the reaction. 

25 The amplification process can be conducted continuously. In one embodiment 

of an automated process, the reaction nuxture can be temperature cycled such that the 
temperature is programmed to be controlled at a certain level for a certain time. One 
such instrument for this purpose is the automated machine for handling the _ 
amplification reaction developed and marketed by Perkin-Elmer Cetus Instruments. 

30 Detailed instructions for carrying out PCR with the instrument are available upon 
purchase of the instrument. 

Tma DNA polymerase is very useful in the diverse processes in which 
amplification of a nucleic acid sequence by the polymerase chain reaction is useful. The 
amplification method may be utilized to clone a particular nucleic acid sequence for 

35 insertion into a suitable expression vector, as described in U.S. Patent No. 4,800,159. 
The vector may be used to transfonm an appropriate host organism to produce the gene 
product of the sequence by standard methods of recombinant DNA technology. Such 
cloning may involve direct ligation into a vector using blunt-end ligation, or use of 
restriction enzymes to cleave at sites contained within the primers. Other processes 



wo 92/03556 



PCT/US91/057S3 



32 

suitable for Tma polymerase include those described in U.S. Patent Nos. 4,683,195 
and 4,683,202 and European Patent Publication Nos. 229.701; 237,362; and 258,017; 
these patents and publications are incorporated herein by reference. In addition, the 
present enzyme is useful in asymmetric PGR (see Gyllensten and Eriich, 1988, hSSL- 
5 Hail- Acad- Ssi- iiSA S5:7652-7656, incorporated herein by reference); inverse PGR 
(Ochman si al-. 1988, Genetics 12Q:62l, incorporated herein by reference); and for 
DNA sequencing (see Innis £tal.. 1988, EeQC. NmI- Agad- Sci- USA £5:9436-9440, 
and McGonlogue £i al-. 1988, Hu£. Acids EfiS- M20):9869). Ima polymerase is also 
believed to have reverse transcriptase activity; see PGT Patent Publication No. 
10 91/09944, published July 11, 1991, incorporated herein by reference. 

The reverse transcriptase activity of Ima DNA polymerase permits this enzyme 
to be used in methods for transcribing and amplifying RNA. The improvement of such 
methods resides in the use of a single enzyme, whereas previous methods have 
required more than one enzyme. 
1 5 The improved methods comprise the steps of: (a) combining an RNA template 

with a suitable primer under conditions v/hereby the primer will anneal to the 
corresponding RNA template; and (b) reverse transcribing the RNA template by 
incubating the annealed primer-RNA template mixture with Tnoa DNA polymerase 
under conditions sufficient for the DNA polymerase to catalyze the polymerization of 
20 deoxyribonucleoside triphosphates to form a DNA sequence complementary to the 
sequence of the RNA template. 

In another aspect of the above method, the primer that anneals to the RNA 
template may also be suitable for amplification by PGR. In PGR, a second primer that 
is complementary to the reverse transcribed cDNA strand provides a site for inidation of 
25 synthesis of an extension product. As ahready discussed above, the Tma DNA 
polymerase is able to catalyze this extension reaction on a cDNA template. 

In the amplification of an RNA molecule by TtEa DNA polymerase, the first 

extension reaction is reverse transcription, in which a DNA strand is produced in the 

form of an RNA/cDNA hybrid molecule. The second extension reaction, using U-.e 
30 DNA strand as a template, produces a double-stranded DNA molecule. Thus, 

synthesis of a complementary DNA strand from an RNA template with Tma DNA 

polymerase provides the starting material for amplification by PGR. 

When Tma DNA polymerase is used for nucleic acid transcription from an RNA 

template, the use of buffers that contain Mtfi* provide improved stimulation of Ima 
35 reverse transcriptase activity compared to previously used, Mg2+ containing reverse 

transcription buffers. GonsequenUy, increased cDNA yields also result from these 

methods. 

As stated above, the product of RNA transcription by Tma DNA polymerase is 
an RNA/cDNA hybrid molecule. The RNA is then removed by heat denaturation or 
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any number of other known methods including alkali, heat, or enzyme treatment. The 
remaining cDNA strand then serves as a template for polymerizarion of a self- 
complementary strand, thereby providing a double-stranded cDNA molecule suitable 
for ampUfication or other manipulauon. The second strand synthesis requires a 
5 sequence specific primer and Tma DNA polymerase. 

Following the synthesis of the second cDNA strand, the resultant double- 
stranded cDNA molecule can serve a number of purposes, including DNA sequencing, 
amplification by PGR. or detection of a specific nucleic acid sequence. Specific 
primers useful for amplification of a segment of the cDNA can be added subsequent to 
10 the reverse transcription. Also, one can use a first set of primers to synthesize a 

specific cDNA molecule and a second nested set of primers to amplify a desired cDNA 
segment. All of these reactions are catalyzed by Tma DNA polymerase. 

Tma DNA polynjerase can also be used to simplify and improve methods for 
detection of RNA target molecules in a sampl(*.. In these methods, Tma DNA 
15 polymerase catalyzes: (a) reverse transcription; (b) second strand cDNA synthesis; and, 
if desired (c) amplification by PGR. In addition to the improvement of only requiring a 
single enzyme, the use of Ima DNA polymerase in the described methods eliminates 
the previous requirement of two sets of incubation conditions that were necessary due 
to the use of different enzymes for each procedural step. The use of Tma DNA 
20 polymerase provides RNA transcription and amplification of the resulting 

complementary DNA with enhanced specificity and witii fewer steps than previous 
RNA cloning and diagnostic methods. These methods are adaptable for use in kits for 
laboratory or clinical analysis. 

The RNA Uiat is transcribed and amplified in the above methods can be derived 
25 from a number of sources. The RNA template can be contained within a nucleic acid 
preparation from any organism, such as a viral or bacterial nucleic acid preparation. 
The preparation can contain cell debris and other components, purified total RNA, or 
purified mRNA. The RNA template can also be a population of heterogeneous RNA 
molecules in a sample. Funhemiore, the target RNA can be contained in a biological 
30 sample, and the sample can be a heterogeneous sample in which RNA is but a small 
portion. Examples of such biological samples include blood samples and biopsied 
tissue samples. 

Although the primers used in the reverse. transcription step of the above 
methods are generally completely complementary to the RNA template, the primers 
35 need not be completely complementary. As in PGR, not every nucleotide of the primer 
must anneal to the template/or reverse transcription to occur. For example, a non- 
complementary nucleotide sequence can be present at the 5* end of the primer with the 
remainder of the primer sequence being complementary to the RNA. Alternatively, non- 
complementary bases can be interspersed into the primer, provided that the primer 
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sequence has sufficient complementarity with the RNA template for hybridization to 
occur and allow synthesis of a complementary DNA strand. 

The following examples are offered by way of illustration only and are by no 
means intended to limit the scope of the claimed invendon. In these examples, all 
5 percentages are by weight if for soUds and by volume if for liquids, unless otherwise 
noted, and all temperatures are given in degrees Celsius. 

EXAMPLE 1 

Piirificarinn of Th ?rmntnpa maritima 
p^TA Polymerase 

10 This example describes the isoladon of Ima DNA polymerase from Th^-rmQm^ 

maritima . The DNA polymerase was assayed at various points during purification 
according to the method described for laa polymerase with one modification (1 mM 
MgCl2) in Lawyer £i al., 1989, L BIqI. Chem . IMd l):6427-6437. incorporated 
herein by reference. 

15 Typically, this assay is performed in a total volume of 50 ^1 of a reaction 

mixture composed of 25 mM TAPS-HCl. pH 9.5 (20*C); 50 mM KCl; 1 mM MgCb; 1 
mM p-mercaptoethanol; 200 ^M in each of dATP. dGTP, and TTP; 100 ^iM a-32P- 
dCTP (0.03 to 0.07 |iCi/nMol); 12.5 Hg of activated salmon sperm DNA; and 
polymerase. The reaction is initiated by addition of polymerase in diluent (diluent is 
20 composed of 10 mM Tris-HCl, pH 8.0. 50 mM KCl. 0. 1 mM EDTA, 1 mg/ml 

autoclaved gelatin. 0.5% NP40. 0.5% Tween 20, and 1 mM p-mercaptoetiianol). and 
the reaction is carried out at 75*C. For the calculations shown below, one assumes that 
the volume of the polymerase (and diluenO added is 5 ^1. and die total reaction volume 
is 50 ^il. After a 10 minute incubation, die reaction is stopped by adding 10 jil of 60 
25 mM EDTA. The reaction mbcture is centrifuged, and 50 ^1 of reaction mixture is 
transfeired to 1 .0 ml of 50 ng/ml earner DNA in 2 mM EDTA (at 0*C). An equal 
volume (1 ml) of 20% TCA. 2% sodium pyrophosphate is added and mbced. The 
mixture is incubated at O'C for 15 to 20 minutes and then filtered through Whatman 
GF/C filters and extensively washed (6 x 5 ml) with a cold mixture containing 5% TCA 
30 and 1 % pyrophosphate, followed by a cold 95% etiianol wash. The fUters are then 
dried and die radioactivity counted. Background (minus enzyme) is usually 0.001% to 
0.01% of input cpm. About 50 to 250 pmoles 32p.dCTP standard is spotted for unit 
calculation. One unit is equal to 10 nmoles dNTP incorporated in 30 minutes at 75'C. 
Units are calculated as follows. 

35 ^ 
«;amplc cpm - enzvme dil. cpm = pmole dCTP incorporated 
specific activity of dCTIP 
(cpm/pmolc) 
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pmole incorporated x 3 x dilution factor x 4 = units/ml 
4.167x10 

The 4.167 factor results fronri counting only 5/6 (50 of the reaction volume after the 
stop solution is added (60 
5 All operations were carried out at 0*C to 4*C unless otherwise stated. All 

glassware was baked prior to use, and solutions used in the purification were 
autoclaved, if possible, prior to use. 

About 50 g of frozen Thermotoga maritima strain MSB8 cells (provided by 
Prof, Dr, K. O. S tetter, Regensburg. Germany) were thawed in 25 ml of 3 x TE-DTT 

10 buffer (150 mM Tris-Cl, pH 7.5. 3 mM EDTA, and 3 mM dithioihreitol) containing 
2.4 mM PMSF (from 144 mM stock in DMF) and homogenized at low speed with a 
magnetic stiirer. The thawed cells were lysed in an Aminco french pressure cell (8 - 
20,000 psi). The lysate was diluted with additional 1 x TE-DTF buffer containing 
fresh 2.4 mM PMSF to final 5.5x cell wet weight and sonicated lo reduce viscosity (40 

15 to 100% output. 9 min., 50% duty cycle). 

The resulting fraction, fraction I (275 ml) contained 5.31 g of protein and 
15.5 x ICH units of activity. Ammonium sulfate was added to 0.2 M (7.25 g) and the 
lysate stirred for 15 minutes on ice. Ammonium sulfate prevents the Tma DNA 
polymerase from binding to DNA in the crude lysate and reduces ionic interactions of 

20 the DNA polymerase with other cell lysate proteins. 

Empirical testing showed that 0.2% Polymin P (polyeihyleneimine, PEI) 
precipitates >92% of the lotai nucleic acid. Polymin P (pH 7,5) was added slowly to 
0.2% (5.49 ml of 10% PEI) and the slurry stirred 30 minutes on ice, then centrifuged at 
30,000 X g at 4*C for 30 minutes. The supernatant was designated fracdon II (246 ml) 

25 and contained 3.05 g of protein and 12.5 x 10^ units of activity. 

Fraction II was adjusted to 0.3 M ammonium sulfate by addition of 3.24 g solid 
ammonium sulfate to ensure complete binding of the DNA polymerase to phenyl 
sepharose. Fraction II was then loaded onto a 2.2 x 6.6 cm (25 ml) phenyl sepharose 
CL-4B (lot OM 08012, purchased from Pharmacia - LKB) column (equilibrated in 7E 

30 containing 0.3 M ammonium sulfate and 0.5 mM DTT; at 38 ml/hr (10 ml/'cm^/lif). All 
resins were equilibrated and recycled according to the manufacturer's 
recommendations. 

The column was washed with 150 ml of the same buffer (A2go to baseline), 
tiien with 90 ml TE containing 0.5 mM DTT (no ammonium sulfate), followed by a 
35 wash with 95 ml of 20% ethylene glycol in TE containing 0.5 mM DTT and finally, 
eluted with 2 M urea in TE containing 20% ethylene glycol and 0.5 mM DTT. When 
the column fractions were assayed, a large proponion of the activity was found in the - 
flow-tiirough and wash fractions, indicating that the capacity of the column had been 
exceeded. Approximately 70% of the DNA polymerase which had bound to this first 
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phenyl sepharose column eluted at low salt (with the TE-DTT wash), and the balance of 
the bound material eluted with 2 M urea in 20% ethylene glycol in TE-DTT wash. 

The flow-through activity from the first phenyl sepharose column was 
designated PSH load (226 ml) and contained 1 .76 g protein. Fraction PSII load was 
5 applied to a second phenyl sepharose column (of the same lot and dimensions), and the 
run was repeated the same way. Again, the capacity of the column was exceeded, and 
activity was found to elute with both the low salt and 2 M urea washes. Only 10% of 
the bound DNA polymerase eluted with the TE-DTT wash; the major portion (-90%) 
eluted with the 2 M urea in 20% ethylene glycol in TE-DTT wash. 
10 The flow-through activity from the second phenyl sepharose column was 

combined v/ith the TE-DTT eluates from the first and second phenyl sepharose columns 
and adjusted to 0.3 M ammonium sulfate. This fraction (PSDI load. 259.4 ml) 
contained 831 mg protein and was reapplied to a third phenyl sepharose column of 50 
ml bed volume at 10 mVcm^/hr. This time, all of the applied activity- was retained by 
1 5 the column and only eluted with the 2 M urea in 20% ethylene glycol in TE-DTT wash. 
All three urea eluates were separately concentrated ~3 to 4 fold on Amicon 
YM30 membranes and dialyzed into heparin sepharose loading buffer shortly after 
elution to avoid prolonged exposure to urea (to avoid carbamylation). The dialyzed and 
concentrated urea eluates were assayed for protein concentration and were found to 
20 vary greaUy in their specific activity. Because the urea eluate from the second phenyl 
sepharose column contained the majority of the activity at significantly higher specitlc 
activity (~8 x 10^ units of activity at -1000 units/mg protein) than the other two eluates, 
it was processed sepa'ately from them. 

The dialyzed and concentrated phenyl sepharose n urea eluate was applied to a 
25 5 ml bed volume heparin sepharose CL 6B (purchased from Pharmacia - LKB) column 
that had been equilibrated with 0.08 M KCl, 50 mM Tris-Cl. pH 7.5, 0.1 mM EDTA, 
0.2% Tween 20. and 0.5 mM DTT. This column and all subsequent colunuis were run 
at 1 bed volume per hr. All of the applied DNA polymerase activity was retained by the 
column. The column was washed with 17 ml of the same buffer (Ajso to baseline) .ind 
30 eluted with 60 ml of a linear 80 to 500 mM KCl gradient in the same buffer. 

Fractions (0.53 ml) eluting between 0.21 and 0.315 M KCl were analyzed by 
SDS-PAGE. The peak fractions eluting between 0.225 and 0.275 M KCl were pooled 
separately. The flanking fractions were kept to be combined later v.dth other fractions. 
The pool of peak fractions (affigel I load) was diluted with affigel-blue buffer Nvithout 
35 KCl to reduce its ionic strength to 0. 1 5 M KCl . 

The affigel I load fraction contained 3.4 mg of protein and was applied to a 4.3 
ml affigel-blue (purchased from BioRad) column, which had been equilibrated in 25 
mM Tris-Cl, pH 7.5. 0.1 mM EDTA. 0.2% Tween 20, 0.5 mM D 1 1 , ai.d 0.15 M 
KCl. AU of the applied Ima DNA polymerase was retained. The column was washed 



wo 92/03556 



PCr/US91/05753 



37 

with 15 ml of the same buffer aiid eluted with a 66 ml linear 0.15 to 0.7 M KCl 
gradient in the same buffer. 

Fractions (0.58 ml) eluting between 0.34 and 0.55 M KCl were analyzed by 
SDS-PAGE and appeared to be >90% pure. The polymerase peak fractions were no 
5 longer contaminated with site-specific endonuclease (indicated by absence of lower- 
molecular-weight specific DNA fragments after one or twenty-two hours incubation at 
65'C with 2 units of Tma polymerase using 600 ng of plasmid pLSG 1 (ccc-DNA)). 
The polymerase peak fractions eluting between 0.3 and 0.55 M were pooled and 
concentrated ~20-fold on an Amicon YM 30 membrane. This fraction was then 

10 diafiUered into 2.5 x storage buffer (50 mM Tris-Cl. pH 7.5, 250 mM KC!. 0.25 mM 
EDTA, 2.5 mM DTT, and 0.5% Tween 20 [Pierce. Surf act-Amps]) and stored at 4*C. 

The urea eluates from the first and third phenyl sepharose columns were 
combined with the flanking firactions from the first heparin sepharose column. This 
pool (HSn load) contained -200 mg protein and was diluted with heparin sepharose 

! 5 buffer without KCl to adjust its ionic strength to 80 mM KCl. HSII load was applied 
to a 16 ml bed volume heparin sepharose column (equilibrated in 80 mM KCl, 50 mM 
Tris-Cl, pH 7.5, 0.1 mM EDTA, 0.2% Tween 20, and 0.5 mM DTT). No detectable 
polymerase activity appeared in the flow-through fractions. 

The column was washed with 80 ml of the same buffer and eluted with a 200 

20 ml linear 80 to 750 mM KCl gradient in the same buffer. Fractions (2 ml) eluting 

between 0.225 and 0.335 M KCl were combined, concentrated ~5-fold on an Amicon 
YM 30 membrane, and dialyzed into hydroxyapatite-buffer. This fraction (HA load) 
contained 9.3 mg protein and was loaded onto a 4 ml bed volume hydroxyapatite (high 
resolution HPT, purchased from Calbiochem) column that had been equilibrated in 10 

25 mM potassium phosphate buffer, pH 7.5, 0.5 mM DTT, 0. 1 mM EDTA, and 0.2% 
Tween 20. All of the applied DNA polymerase activity was retained by the column. 

The column was washed with 12 ml of the same buffer and eluted with a 60 ml 
linear 10 to 500 mM potassium phosphate (pH 7.5) gradient. Fractions (0.8 ml) 
eluting between 0.105 and 0.230 M pota.ssium phosphate were analyzed by SDS- 

30 PAGE. Compared to the affigel column I load fraction (which by SDS-PAGE 

appeared lo be -10 to 20% pure), these fractions were ~5-fold less pure. The DNA 
polymerase peak fractions eluting between 0.105 and 0.255 M potassium phosphate 
were combined, concentrated ~3-fo!d on an .Amicon YM 30 membrane, and diafiltered 
into affigel-blue buffer. 

35 The affigel II load fraction was applied to a 3 ml bed volume affigel-blue 

column that had been equilibrated in affigel-blue buffer. No detectable DNA 
polymerase'activity appeared in the flow-through fractions. The column was washed 
with 9 ml of the same buffer and eluted with a 50 ml linear 0.2 to 0.7 M KCl gradient 
in the same buffer. Fractions (0.58 ml) eluting between 0.33 and 0.505 M KCl were 
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analyzed by SDS-PAGE. Because the earlier eluting fractions looked sUghUy cleaner 
by their sUver staining pattern, two pools were made. Fractions eluting between 0.31 
and 0.4 M KCl were combined into pool I; fracuons eluting between 0.4 and 0.515 M 
KCl were combined into pool U. The two pools were each separately concentrated ~7- 
5 fold on an Amicon Y1.'1 30 membrane. 

AU three affigel-blue pools sdU contained high levels of contaminating, non- 
specific nucleases. Upon incubation at IQ'C with 1.5 units of DNA polymerase, botii a 
single-strand M13 DNA template and a multifragment restriction digest of a plasmid 
were degraded within a few hours. In sim-activity gels were run and showed that the 
1 0 DNA polymerase fractions had not suffered proteolytic degradation. 

Tne two pools from the second affigel-blue column were combined and 
dialyzed into a phosphocellulose column buffer. The dialyzed fraction (Pll I load) was 
loaded onto a 3 ml phosphocellulose column, which had been washed overnight with 
25 mM Tris-Cl, pH 7.5. 50 mM KCl. 0.1 mM EDTA, 0.2% Tween 20, and 0.5 mM 
1 5 Dir. This wash later proved lo have been insufficient to equilibrate the pH of the 
phosphocellulose resin. Unfortunately, this was discovered after the sample had been 
loaded onto die column. All of the applied activity bound to the column. 

The column was washed with 9 ml of loading buffer and eluted with a 45 ml 
linear 50 to 700 mM KCl gradient. DNA polymerase peak fracuons (0.58 ml) eluting 
20 between 0.46 and 0.575 M KCl were analyzed by SDS-PAGE. 

Separation of contaminating proteins was observed throughout the peak: a -45 
kDa contaminating band elutes at 0.53 M KCl; an -85 kDa contaminating band has an 
elution peak at 0.54 M KCl. Therefore, this column was repeated (loading at 
somewhat higher ionic strength considering the elution profile of the polymerase). The 
25 peak fractions, eluting between 0.475 and 0.56 M KCl from the first phosphoceUulose 
column were combined with the pool from the fu^i affigel column. The combined 
fraction (Pll II load) now contained all of the purified polymerase (-7.5 x 10* units). 

Fraction Pll II load was diluted with phosphocellulose buffer to adjust its ionic 
strength to 0.2 M KCl. Pll 11 load was loaded onto a 9 ml bed volume 
30 phosphocellulose column, which, this time, had been equilibrated to the correct pH and 
ionic strength of 25 mM Tris-Cl. pH 7.5, 200 mM KCl. 0.1 mM EDTA. 0.2% Tween 
20. and 0.5 mM DTT. The column was washed with 27 ml of the same buffer and was 
intended to be eluted with a 140 ml linear 0.2 to 0.8 M KCl gradient. However, 
instead of an upper limit buffer of 0.8 M KCl. the buffer had a concentration of 52 mM 
35 KCl which resulted in a gradient decreasing in salt. The column was then 

reequiUbrated with 32 ml of 0.2 M KCl-phosphocellulose buffer, and the 140 ml linear 
0.2 to 0.8 M KCl gradient was reapplied. 

The routine assays of fiow-through, wash, and gradient fractions :>howe'i that, 
at tiiis higher pH (pH 7.5), the DNA polymerase does not bind to the phosphocellulose 
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resin at 0.2 M KCl. The DNA polymerase activity containing fractions from the flow- 
through, wash, and decreasing salt-gradient-fractions were combined. The resulting 
pool was concentrated on an Amicon YM30 membrane. However, a mishap with the 
concentrator led to further losses of DNA polymerase activity. The recovered activity 
5 was dialyzed into phosphocellulose buffer with 50 mM KCl and designated PU in load. 

This fraction was loaded onto a 5 ml bed volume phosphocellulose column that 
had been equilibrated with phosphocellulose buffer with 50 mM KCl. All of the 
applied activity was retained by the column. The column was washed with 15 ml of the 
same buffer and eluted with a 45 ml linear 50 to 500 mM KCl gradient in the same , 

10 buffer. Fractions (0.87 ml) eluting between 0.16 and 0.33 M KCl were analyzed by 
SDS-PAGE and in silu activity gels. 

Based on the silver staining pattern, two pools were made. The peak fractions, 
eluting between 0.215 and 0.31 M KCl, were kept separate from the leading and 
trailing fractions, which were combined into a side-fractions pool. Both pools were 

15 concentrated on centricon 30 membranes and diafiltered into 2.5x storage buffer (50 
mM Tris-HCl, pH 7.5, 250 mM KCl, 0.25 mM EDTA, 2.5 mM DTT, and 0.5% 
Tween 20 [Pierce, Surfact-AmpsJ) and subsequently mixed with 1.5 volumes of 80% 
glycerol. 

About 3.1 X 10^ units were recovered in the peak fraction; the side pool yields 
20 an additional 1 x 103 units of activity. The purified DNA polymerase was undegraded 
as evidenced by an unchanged migration pattern in an in sim activity gel. The 
molecular weight as determined by gel electrophoresis of the purified DNA polymerase 
is approximately 97 kDa. Tma DNA polymerase is recognized by epitope-specific 
antibodies that correspond to Tag DNA polymerase amino acid residues number 569 
25 through 587 (DGTPl) and 718 through 732 (DGTP3). 

EXAMPLE 2 

Isolation of DNA Encoding Tma 
DNA Polymerase I Activity 
Synthetic oligodeoxyribonucleotides DG164 tiirough DG167 are four different 
30 16- fold degenerate (each) 22mer pools designed as "forward*' primers to one of tiie 
motifs in the template binding domains (3'-mosi 14 nucleotides) of thermostable DNA 
polymerases. This motif is the amino acid sequence Gly-Tyr-Val-Glu-Thr and 
corresponds identically to tiie I. aquaticus (Tag ) DNA polymerase amino acids 718 
through 722 and to die T. ihermophilus CTxh) DNA polymerase amino acids 720 
35 through 724. This motif is found in a DNA polymerase gene in all Therm us species. 
The combined primer pool is 64-fold degenerate, and the primers encode a Bgl ll 
recognition sequence at their 5'-ends. 

Forward primers DG164 through DG167 are shown below: 
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DG164 SEQIDN0:2 5"CGAGATCTGGNTAYGTWGAAAC 
DG165 SEQ ID NO: 3 5'CGAGATCTGGNTAYGTWGAGAC 
DG166 SEQ ID NO: 4 5'CGAGATCTGGNTAYGTSGAAAC 
DG167 SEQ ID NO: 5 5'CGAGATCTGGNTAYGTSGAGAC 
5 In these forward primers: A is Adenine; C is Cytidine; G is Guanidine; T is Thymine; 
Y is C+T (pYrimidine); S is G+C (Strong interaction; 3 H-bonds); W is A+T (Weak 
interacuon; 2 H-bonds); and N is A+C+G+T (aNy). 

Synthetic oligodeoxyribonucleotides DG160 through DG 163 are four different 
8-fold degenerate (each) 20mer pools designed as "reverse" primers to one of the 
10 motifs in the template binding domains (3'-most 14 nucleotides) of thermostable DNA 
polymerases. These primers are designed to complement the (-H)-strand DNA sequence 
that encodes the motif Gln-Val-His-Asp-Glu and that corresponds idcndcally to the Jm. 
DNA polymerase amino adds 782 through 786 and to the Tm DNA polymerase amino 
acids 784 through 788. This motif is found in a DNA polymerase gene in all BermM 
1 5 species. The combined primer pool is 32- fold degenerate, and the primers encode an 
Eco RI recognition sequence at their 5'-ends. 

Reverse primers DG160 through 163 are shown below: 
DG160 SEQ ID NO: 6 5'CGGAATTCRTCRTGWACCTG 
DG 161 SEQ ID NO: 7 5"CGG AATTCRTCRTGWACTTG 
DG162 SEQ ID NO: 8 S'CGGAATTCRTCRTGSACCTG 
DG163 SEQ ID NO: 9 5'CGGAATTCRTCRTGSACITG 
In these reverse primers A. C. G. T. S. and W are as defined above, and R is G+A 
(puRine). 

To amplify an -230bp fragment of the Ima DNA polymerase gene, a PGR 
25 amplification tube was prepared without MgCli that contained in 80 ^ll: (1) 5 ng 

denatured Tma genomic DNA; (2) 50 pmoles (total) of the combined forward primer set 
DG164-DG167; (3) 50 pmoles (total) of the combined reverse primer set DG160- 
DG163; (4) 2 units lafl DNA polymerase; (5) 50 ^iM each (final) dNTP; (6) 0.05% 
Uurcth-12; and (7) standard PGR buffer except no magnesium chloride. 
30 The sample was flash-frozen at -70*C and then stored at -20*C. The frozen 

sample was carefully layered with 20 \i\ of lOmM MgCh (final concentration 2 mM). 
immediately overiayed with 50 nl of mineral oil, and cycled in a Perkin Elmer Cetus 
Tliemial Cycler according to ilie following file: (i) step to 9Z'C - hold 50 seconds; (2) 
step to 50-C - hold 10 seconds; (3) ramp to 75-C over 4 minutes; and (4) step to 
35 The file was repeated for a total of 30 cycles. One-fifth (20 ^1) of the amplification 
product was purified on a 3% Nusieve/1% Seakem agarose composite gel, and the 
approximately 230 bp fragment was eluted, concentrated, and digested with BglU and 
EcoRI. 



20 



. wo 92/03556 



PCr/US91/05753 



41 

Synthetic oligodeoxyribonucleotides DG154 and DG155 are two different 32- 
fold degenerate (each) 19mer pools designed as "forward" priiiiers to one of the nnotifs 
in the primer: template binding domains (3'-most 1 1 nucleotides) of thermostable DNA 
polymerases. This motif is the tetrapeptide sequence Tnr-Ala-Thr-Gly and corresponds 
5 identically to the Tag DNA polymerase amino acids 569 through 572 and to Iih DNA 
polymerase amino acids 57 1 through 574. This motif is found in a DNA polymerase 
gene in all Themius species. The combined primer pool is 64-fold degenerate and the 
primers encode a Bgi n recognition sequence at their 5'-ends, 

Forv/ard primers DO 154 and DO 155 are presented below: 

10 DG154 SEQ ID NO: 10 CGAGATCTACNGCNACWGG 
DG155 SEQ ID NO: 11 CGAGATCTACNGCNACSGG 
In these forward primers, A, C, G, T, S, W, and N are as defined above. 

To amplify an approximately -^67 bp fragment of the Jnm DNA polymerase 
gene, a PGR amplification tube was prepared without MgCl2 that contained, in 80 ^il: 

15 (1) 5 ng denatured Tma genomic DNA; (2) 50 pmoles (total) of the combined forward 
primer set DG154-DG155; (3) 50 pmoles (lotal) of the combined reverse primer set 
DG160-DG163; (4) 2 Units of lag DNA polymerase; (5) 50 each (final) dNTP; (6) 
0.05% Laureth 12; and (7) standard PGR buffer except no magnesium chloride. 

The sample was flash-frozen at -70*C and then stored at -20*C. The frozen 

20 sample was carefully layered with 20 ^xl of 10 mM MgCl2 (final concentration 2 mM), 
immediately overlayed with 50 jil of mineral oil, and cycled in a Perkin Elmer Gems 
Thermal Gycler according to the following file: (1) step to 98*C - hold 50 seconds; (2) 
step to 55'C - hold 10 seconds; (3) ramp to 75'C over 4 minutes; (4) step to 98*C The 
file was repeated for a total of 30 cycles, 

25 One-tlfth (20 of the amplification product was piuified on a 1.5% agarose 

gel, and the approximately 670 bp fragment was eluted, concentrated, and digested 
with Bglll and EcqRI as above. 

These amplification reactions yielded a 667 bp fragment and a 230 bp fragment, 
which was a subfragment of the 667 bp fragment. Tnese fragments proved useful in 

30 obtaining the complete coding sequence for the Tma DNA polymerase I gene, as 
described in the following example. 

EXAMPLES 

Cloning the Thermotopa maritima (Tma l 
DNA Polymerase I Gene 
This Example describes the strategy and methodology for cloning the Tma DNA 
polymerase I (Tma Pol I) gene of Thenmotoga maritima . 

The DNA sequences of the PGR products generated with primers DG 164- 167 
andDG160-163 (230 bp) and DG154. 155 and DG160-163 (667 bp) contain an Xma l 
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restriction site recognition sequence, 5'CCCGGG. Oligonucleotides were designed to 
hybridize to sequences upstream and downstream of the 2CmaI site. DG224 is a 21mer, 
homologous to the PGR products 59-79 bp 3-distal to the 2CmaI site. DG225 is a 
22mer. homologous to the PGR products from the site to 21bp upstream (5') of 
5 the JTjmI site. The sequence of DG224 and of DG225 is shown below (K is G or T). 
DG224 SEQ ID NO: 12 5'ACAGCAGCKGATATAATAAAG 
DG225 SEQ ID NO: 13 5'GCCATGAGCTGTGGTATGTCTC 
DG224 and DG225 were labelled by taiUng with biotin-dUTP and terminal transferase 
in reactions designed to add approximately 8 biotin-dUMP residues to the 3*-end of 
10 oUgonucleotides. These labelled oligonucleotides were used as probes in Southern blot 
analyses of restriction digests of genomic Ima DNA. A preUminary restriction map 
was generated based on the Southern analysis results, and the DNA sequences of the 
PGR products that were generated as described in Example 2. 

The preliminary map showed that the entire Ima DNA polymerase gene is 
15 contained in two Xmal fragments. Most of tiie gene, including the 5'-end, resides on 
an approximately 2.6 kb 2<IMI fragment. The remainder of the gene (and the 3*-end) 
resides on an approximately 4.2 kb 2<iml fragment. The two 2Cm2l fragments 
containing the entire Tma DNA polymerase gene were cloned into plasmid pBS13+ 
(also called pBSM13+) as described below. 
20 About 40 micrograms of Tma genomic DNA were digested to completion with 

Xmal . The 2CmaI digest was size-fractionated via electroelution. Slot blot analyses of 
a smaU portion of each fraction, using Y-32P-ATP-kinased DG224 and DG225 probes, 
identified the fractions containing the 4.2 kb 3'-fragment (hybridizing with DG224) and 
the 2.6 kb 5'-fragment (hybridizing with DG225). Fractions were concentrated via 
25 ethanol precipitation and then ligated with 2<mal-digested pBS 1 3+ (Stratagene). 

Ampicillin-resistani transformants were selected on nitrocellulose filters and the fUters 
probed with Y-32p-ATP-kinased DG224 or DG225 probe as appropriate. Plasmid 
DNA was isolated from colonies tiiat hybridized with probe. Restriction analysis was 
performed to confirm that fragments were as expected and to determine orientation of 
30 fragments relative to the pBS 13+ vector. 

DNA sequence analysis of the cloned fragments was performed using the 
"universal" and "reverse" sequencing primers (which prime in the vector, outside the 
restriction site polylinker region). In addition, for 5'-clones, the primers used to 
determine the DNA sequence of the DG154-155/DG160-163 667 bp PGR clone were 
35 employed. Preliminary DNA sequence analysis confirmed that the desired DNA 
fragments containing the Ima DNA polymerase gene had been cloned. 

From the preliminary DNA sequence, further sequencing primers were 

designed to obtain DNA sequence of more internal regions of the fragm.ents. Ln 
addition, to facilitate DNA sequence analysis, several deletions of die two }<mal 
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fragments were made. For both orientations of the 2.6 kb 5*-fragment, EcoRI, SacI, 
and Xbal digests were each diluted and ligated under conditions that favored 
intramolecular ligation, thus deleting DNA between the vector EcoRI. Sac L and Xbal 
sites and the corresponding sites in the Tma Xma l fragment. Such internal deletions 
5 allow ready DNA sequence analysis using the "universal" or "reverse" sequencing 
primers. 

Sim:iarly, a deletion of the 4.2 kb 3'-fragment was made, fusing the BamHI 
site of the vector with the Bgin site approximately 650 bp from the Tma Pol I internal 
Xma l site in that clone (Bam HI and Bglll have identical GATC cohesive ends that 

10 ligate readily with one another). This deletion allows for DNA sequence analysis of the 
3*-end of the Ima Pol I gene. 

Restriction site analysis reveals that both the 2,6 kb 5*-fragment and the 4.2 kb 
3 -fragment lack Ncol. Nde l. and Ajel restriction sites. Knowing the ATG start and 
coding sequence of the Tma Pol I gene, one can design oligonucleotides that will alter 

15 the DNA sequence at the ATG stan to include an Nco L Nde l. or Ase l restriction site 
via oligonucleotide site-directed mutagenesis. In addition, the mutagenic 
oligonucleotides can be designed such that a deletion of sequences between the lac 
promoter in the pBS 1 vector and the beginning of the Tma Pol I gene is made 
concurrent with the inclusion of an Nde l or Ase l recognition sequence at the ATG stan. 

20 The deletion of sequences between the lac promoter in the vector and stan of the 

Tma Pc gene would also eliminate the Xma l restriction site in the deleted region, thus 
making it convenient to assemble the entire coding sequence in an expression plasmid 
using conventional skill in the an (see. e.g., synthesis protocols for pDG174 - pDGlSl 
in copending Serial No. 455,967, filed December 22. 1989, incorporated herein by 

25 reference, and Example 5). 

EXAMPLE 4 

PGR With Tma DNA Polymerase 
. About 1 .25 units of the Tma DNA polymerase purified in Example 1 is usexl to 
amplify rRNA sequences from Tih genomic DNA. The reaction volume is 50 jil, and 
30 tlie reaction mixture contains 50 pmol of primer DG73, 105 to 106 copies of the Tth 
genome (-2 x 105 copies of genome/ng DNA), 50 pmol of primer DG74, 200 |iM of 
each dNTP, 2 mM MgCh. 10 mM Tris-HC!, pH 8.3. 50 mM KCl, and 100 iig/wA 
gelatin (optionally, gelatin may be omitted). 

The reaction is carried out on a Perkin-Elmer Cetus Insnnments DNA Thermal 
35 Cycler. Twenty to thiny cycles of 96'C for 15 seconds; 50'C for 30 seconds, and 

75*C for 30 seconds are cariied out. At 20 cycles, the amplification product (160 bp in 
size) can be faintiy seen on an eihidium bromide stained gel, and at 30 cycles, Lhe 
product is readily visible (under UV light) on the ethidium bromide stained gel. 
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The PGR may yield fewer non-specific products if fewer units of Jm DNA 
polymerase are used (i.e.. 0.31 units/50 nl reaction). Furthermore, the addition of a 
non-ionic detergent, such as laureth-12, to the reaction mixture to a final concentration 
of about 0.5% to 1% can improve the yield of PGR product. 
5 Primers DG73 and DG74 are shown below: 

DG73 SEQ ID NO: 14 5' TAGG iTCCGrGGGGTTGTAG 
DG74 SEQ ID NO: 1 5 5' AGGAGGTGATCCAACGGCA 

EXAMPLE 5 

Rprombin i^ni; F-xpression 
10 Vy^ninrs forT m?^ HNA Polymerase 

A. Mutagenesis o f V anH 3' Ends of the Tmn Pol I Gene 

The 5' end of the Tma gene in vector pBS:Tma7-l (ATCC No. 6847 1, later 
renamed pTmaOl) was mutagenized with oligonucleotides DG240 and DG244 via 
oligonucleotide site-directed mutagenesis. Plasmid pBS:Tma7-l consists of the 2.6 kb 
15 5' Xmai fragment cloned into vector pBS 13+. Resultant mutants from both 

mutageneses had deletions between the ATG of p-galactosidase in the pBS+ vector and 
the ATG of Tma Pol 1 so that the Tma coding sequence was positioned for expression 
utilizing the vector lac promoter, operator, and ribosome binding site (RBS). Both sets 
of mutants also had alterations in the second and sixth codons for Tma Pol I to be more 
20 compatible with the codon usage of E. soli without changing the amino acid sequence 
of the encoded protein. In addition, DG240 placed an im restriction site at die ATG 
start of tiie coding sequence (5'CATAIQ), and DG244 placed an tel restriction site 
at the ATG start of the coding sequence (5'CCAIGG). DG240 mutant candidate 
colonies were screened wiUi [732?]. labelled oligonucleotide DG241, and DG244 
25 m.utant candidate colonies were screened with lY32p]-labelled oUgonucleotide DG245. 
Plasmid DNA was isolated from colonies that hybridized wiOi xht appropriate probes, 
- and mutations were confirmed via restriction analysis and DNA sequence analysis. The 
DG240 mutant was named pTma5"Nde#3 and later renamed pTma06. The DC244 
mutant was named pTma5*Nco#9 and later renamed pTma07. 
30 The 3'-end of the Tma Pol I gene was mutagenized in pBSTma3'l 1-1 Bam/Bgl 

(ATCC No. 68472, later renamed nTma04) with mutagenic oligonucleotide DG238. 
Plasmid pBSTrna3'l 1-1 Bam/'Bgl was constructed as described in Example 3 by 
cloning tiie 4.2 kb 3' Xml fragment into pBS13+, digesting die resulting plasmid with 
Bamm and figin, and circularizing by ligation the large fragment from die digestion. 
35 DG238 insens EcoR V and BamH I sites immediately downstream of Uie TGA stop 
codon. Mutant colony candidates were identified with [ySiPl-labelled oligonucleotide 
DG239. Plasmid DNA isolated from positive colonies was screened for appropriate 
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restriction digest patterns, and the DNA sequence was confirmed. One correct plasmid 
obtained was designated as pTma3'mut#l and later renamed pTmaOS. 

B. Assembling the Full-Length Gene in a lac Promoter Vector 

For purposes of studying low level expression of Tma Pol I in £. coli and 
5 possible complementation of E. coH polymerase mutants by Tma Pol I (where high 
level expression might kill the cell, but where low level expression might rescue or 
complement), the Tma Pol I gene was assembled in the pBS13+ cloning vector. An 
--300 bp Xma l to EcoR V fragment from pTma3*mut#l was isolated and purified, 
following agarose gel electrophoresis and ethidium bromide staining, by excising an 

10 agarose gel slice containing the -300 bp fragment and fre< ding in a Costar spinex filter 
unit. Upon thawing, the unit was spun in a microfuge, and the liquid containing the 
DNA fragment was collected. After ethanol precipitation, the fi^gment was ligated with 
each of the two 5'-mutated vectors, pTma5'Nde#3 and pTma5*Nco#9, which had each 
been digested with Asg? 18. repaired with Klenow and all 4 dNTPs (the reaction 

15 conditions are 56 mM Tris-CI. pH 8.0. 56 mM NaCI. 6 mM MgCb. 6 mM DTT, ^ .M 
dNTPs. and 1 1 units of Klenow at 37'C for 15 minutes; then inactivate at 75'C for 10 
minutes), and then further digested with Xma l. 

The ligation was carried out in two steps. To ligate the Xma l sticky ends, the 
conditions were 20 ^ig/ml total DNA, 20 mM Tris-Cl, pH 7.4, 50 mM NaCl, 10 mM 

20 MgCl2, 40 ^iM ATP, and 0.2 Weiss units T4 DNA ligase per 20 |il reaction at O'C 
overnight. To ligate Asc718-digesied. Klenow repaired blunt ends wiUi EcoR V- 
digested blunt ends, tiie first ligations are diluted 4 to 5 fold and incubated at 15*C in 
Uie same ligation buffer, except 1 mM ATP and 10 Weiss units of T4 DNA ligase are 
used per 20 |il reaction. Ligations were transformed into DGlOl host cells. 

25 Candidates were screened for appropriate restriction sites, and the DNA sequences 
around the cloning sites was confirmed. The desired plasmids were designated 
pTmaOS (Ndsl site at ATG) and pTma09 (Nco l site at ATG). 

C. Assembling the Full-Len gth Gene in P i ^Expression Vectors 

The following table describes Pl promoter expression vectors used for 
30 assembling and expressing full-lengtii Tma Pol I under the control of X Pl promoter. 

Oligonucleotide Duplexes 
Vggtpr Site at ATG RBS* Asufl+A** Cloned into pTXjl60 or pDG161 Amp/Tet^** 



pDG174 ii^l T7 - r)G106/DG107 Anip 

35 pDG178 im N - CKjllO/DGlll Amp. 

pDG182 fcJ£Ql T7 + FL42/FL43 Amp 

pDG184 tiCQl N + FL44/FL45 Amp 

pIXjl85 NcQl N + FL44/FL45 Tei 
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*RBS - Phage T7 gene 10 or lambda gene N ribosome bind site. 

** AsuH sites destroyed by digestion with £se45I. repair with Klenow, and ligation 

of the repaired ends. . 
*** Antibiouc resistance determinant ampicilhn or tetracycline. 

5 The five vectors in the table are derivatives of plasmid pDG160. if ampicUUn resistant, 
or pDGl 61, if tetracycline re 3is-^t. PlasmidspDGl60andpDG161 and the scheme 
for constructing vectors similar to the pDG vectors shown in the table ar- described in 
Serial No. 455,967, filed December 22. 1989, incoiporated herein by reference. TTie 
vectors confer ampidllin or tetracycline resistance and all contain the 5-toxin positive 

10 retroregulator from Bacillus thnringiensis and the same point mutations in the RNA n 
gene that render the plasmids temperature sensitive for copy number. 

The probes and oligonucleotides described in the Table are shown below. 

DG240 SEQ ID NO: 16 

5'CCATCAAAAAG/-/vATAGTCTAGCCATATGTGTrTCCTGTGTGAAATrG 
15 DG241 SEQ ID NO: 17 5'AAACACATATGGCTAGAC 
DG244 SEQ ID NO: 18 

5-CCATCAAAAAGA.\ATAGTCTACCCATGGTrGTrrCCTGTGTGAAATrG 

DG245 SEQ ID NO: 19 5'AAACAACCATGGCTAGAC 

DG238 SEQ ID NO: 20 
20 5'GCAAA.^CATGGTCGTGATATCGGATCCGGAGGTGTTATCTGTGG 

DG239 SEQ ID NO: 21 S CCGAVATCACGACCATG 

DG106 SEQ ID NO: 22 S'CCGGAAGAAGGAGATATACATATGAGCT 

DG107 SEQ ID NO: 23 5'CATATGTATATCTCCTrCTr 

DGllO SEQ ID NO: 24 5 CCGGAGGAGAAAACATATGAGCT 

25 DGlll SEQ ID NC. 25 SCATATGTnTCTCCT 

FL42 SEQ ID NO: 26 5'CCGGAAGAAGGAGAAAATACCATGGGCCCGGTAC 

FL43 SEQ ID NO: 27 S CGGGCCCATGGTATnTCTCCTTCTT 

FL44 SEQ ID NO: 28 5CCGGAGGAGAAAATCCATGGGCCCGGTAC 

FL45 SEQ ID NO: 29 S'CGGGCCCATGGATnTCTCCT 
30 A three-fragment ligation was used to assemble the Tma Pol I gene in the 

vectors. The vectors are digested with Smal and either Ndfil (pDG174. pDG178) or 
Ncol (pDG182, pDG184, pDG185). The 5' end of the Tim Pol I gene is from 
pTma5'Nde#3 digested with Ndel and Xffial or pTma5'Nco#9 digested with Ncol and 
Xmal. The 3" end of the gene is from pTma3'mut#l digested with Xm&l and EeoRV 
35 and the -300 bp fragment purified as described above. 

The plasmid pDG182 shown in the Table and the scheme above were used to 
construct expression ■• xtor pTmal3. The plasmid pDGl84 and the scheme above 
were used to construct expression vectors pTmal2-l and pTmal2-3. Plasmid 
pTmal2-3 differs from pTmal2-l in that pTmal2-3 is a dimer of pTm.a12-l produced 
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during the same ligaoon/transformarion protocol. The plasmid pDG185 and the scheme 
shown above were used to construct expression vector pTMal 1 . 

Even though a veaor may contain the entire polymerase coding sequence, a 
shortened form of the enzyme can be expressed either exclusively or in combination 

5 wdth a full length polymerase. These shortened forms of Tma DNA polymerase result 
from translation initiation occurring at one of the methionine (ATG) codons in the 
coding sequence other than the 5'-ATG. Tlie monomeric pTmal2-l plasmid produces, 
upon heat induction, predominantiy a biologically active thermostable DNA polymerase 
lacking amino acids 1 through 139 of native Tma DNA polymerase. This 

10 approximately 86 kDa protein is the result of translation initiation at the methionine 
codon at position 140 of the Tma coding sequence and is called MET140. 

In shake flask studies under the appropriate conditions (heat induction at 34*C 
or 36*C, but not 38*C), the multimeric pTmal2-3 expression vector yielded a 
significant level of "full length" Tma DNA polymerase (approximately 97 kDa by SDS- 

15 PAGE) and a smaller amount of the shonened (approximately 86 kDa) form resulting 
from translation initiation at Met 140. Aniino-acid sequencing of the full length Tma 
DNA polymerase indicated that the amino-terminal methionine was removed and the 
second-position alanine was present at Ihe N-terminus. 

Recombinani Tma DNA Polymerase was purified from E. cqH strain DGl 16 

20 containing plasmid pTmal2-3. The seed flask for a 10 L fermentation contained 
tryptone (20 g/1), yeast extract (10 g/1), NaCl (10 g/1), ampicillin (loO mg/1), and 
thiamine (10 mg/1). The seed flask was innocalated with a colony from an agar plate (a 
frozen glycerol culture can be used). The seed flask was grown at 30*C to between 0.5 
to 2.0 O.D. (Afiso)- The volume of seed culture inoculated into tiie fermentor is 

25 calculated such tha^ the bacterial concentration is 0.5 mg dry weight/liter. The 12.5 liter 
growth medium contained 60 mM K2HPO4, 16 mM NaNIlJflPO^, 10 mM citric acid, 
and 1 mM MgS04. The following sterile components were added: 2 g/1 glucose, 10 
mg/1 tliiamine, 2.5 g/1 casamino acids, 100 mg/1 ampicillin, and 100 mg/1 methicilUn. 
Foaming was controlled by the addition of propylene glycol as necessary, as an 

30 antifoaming agent. Airflow v/as maintained at 2 1/min. 

The fermentor was inoculated as described above, and the culture was grown at 
30*C for 4.5 hours to a cell density (Aego) of 0.7. The growth temperature was shifted 
to 35*C to induce the synthesis of recombinant Tma DNA polymerase. The temperature 
shift increases the copy number of the pTmal2-3 plasmid and simultaneously 

35 derepresses the lambda Pl promoter controlling transcription of the modified Tma DNA 
polymerase gene through inactivation of the temperature-sensitive cl repressor encoded 
by the defective prophage lysogen in the host. The cells were grown for 21 hours to an 
optical density of 4 (A680) and harvested by centrifugation. The resultLng ceU nar.te was 
stored at -70*C. 
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Recombinant Tma DNA polymerase is purified as in Example 6, below. 
Brieny, cells are thawed in 1 volume of TE buffer (50 mM Tris-Cl, pH 7.5, and 1.0 
mM EDTA with ImM DTT), and protease inliibitors are added (PMSF to 2.4 mM, 
leupeptin to 1 Hg/ml. and U-CK to 0.2 mM). The cells are lysed in an Aminco ftench 
5 pressure ceU at 20,000 psi and sonicated to reduce viscosity. The sonicate is diluted 
with TE buffer and protease inhibitors to 5.5 X wet weight cell mass (Fraction I), 
adjusted to 0.3 M ammonium sulfate, and brought rapidly to 75*C and maintained at 
75*C for 15 min. The heat-treated supernatant is chUled rapidly to 0*C, and the E. coH 
cell membranes and dentaured proteins are removed following centrifugation at 20,000 
10 X G for 30 min. The supernatant containing Ima DNA polymerase (Fraction U) is 
saved. The level of Polymin P necessary to precipitate >95% of Uie nucleic acids is 
determined by trial precipitation (usually in the range of 0.6 to 1% w/v). The desired 
amount of Polymin P is added slowly with rapid stirring at O'C for 30 min. and the 
suspension centrifuged at 20,000 X G for 30 min. to remove the precipiuted nucleic 
1 5 acids. The supernatant (Fraction UI) containing the Jm DNA polymerase is saved. 

Fraction HI is applied to a phenyl separose column that has been equUibrated in 
50 miM Tris-Cl, pH 7.5, 0.3 M ammonium sulfate, 10 mM EDTA, and 1 mM DTT. 
The column is washed with 2 to 4 column volumes of the same buffer (A28O to 
baseline), and Uien 1 to 2 column volumes of TE buffer containing 100 mM KCl to 
20 remove most contaminating E. QQh proteins. Ima DNA polymerase is then cluted from 
die column with buffer containing 50 mM Tris-Cl, pH 7.5, 2 M urea, 20% (w/v) 
eUiylene glycol, 10 mM EDTA. and 1 mM DTT, and fractions containing DNA 
polymerase activity are pooled (Fraction IV). 

Final purification of recombinant Tma DNA polymerase is achieved u.«;ing 
25 heparin sepharose chromatography (as for native or MET284 recombinant DNA 
polymerase), anion exchange chromatography, or affigel blue chromatography. 
Recombinant Tma DNA polymerase may be diafiltered into 2.5X storage buffer, 
combined with 1.5 volumes of sterile 80% (w/v) glycerol, and stored at -20*C. 

Example 6 

30 Ex pression of a Truncated Tma Polvmerase MET284 

As noted above, expression plasmids containing the complete Tma gene coding 
sequence expressed either a full lengtii polymerase resulting from translation initiation 
at the Stan codon or a shonened polymerase resulting from translation initiation 
occurring at the methionine codon at position 140. A tiiird methionine codon tiiat can 

35 act as a translation initiation site occurs at position 284 of tiie Ima gene coding 

sequence. Plasmids that express a DNA polymerase lacking amino acids 1 tiirough 283 
of native Ima DNA polymerase were constructed by introducing deleting 
corresponding regions of Uie Tma coding sequence. 
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Plasmid pTmal2-l was digested with BspHI (nucleotide position 848) and 
Hindin (nucleotide position 2629). A 1781 base pair fragment was isolated by agaix>se 
gel purification. To separate the agarose from tiie DNA, a gel slice containing the 
desired fragment was frozen at -20'C in a Costar spinex filter unit. After tiiawing at 
5 room temperature, the unit was spun in a microfuge. The filtrate containing the DNA 
was concentrated in a Speed Vac concentrator, and the DNA was precipitated with 
ethanol. 

The isolated fragment was cloned into plasmid pTmal2-l digested with Nco l 
and HindllL Because Nco l digestion leaves tiie same cohesive end sequence as 

10 digestion with EspHl* the 1781 base pair fragment has the same cohesive ends as the 
full length fragment excised from plasmid pTmal2-l by digestion with Nco l and 
Hind in. The ligation of the isolated fragment with the digested plasmid results in a 
fragment sv^tch and was used to create a plasmid designated pTmal4. 

Plasmid pTmalS was similarly constructed by cloning the same isolated 

15 fragment into pTmalB. As with pTmal4, pTmal5 drives expression of a polymerase 
that lacks amino acids 1 through 283 of native Tma DNA polymerase; translation 
initiates at the methionine codon at position 284 of the native coding sequence. 

Both the pTmal4 and pTmalS expression plasmids expressed at a high level a 
biologically active thermostable DNA polymerase of molecular weight of about 70 kDa; 

20 plasmid pTmalS expressed polymerase at a higher level than did pTmal4. Based on 
similarities with E. cqH Pol 1 Klenow fragment, such as conservation of amino acid 
sequence motifs in all three domains that are critical for 3'-5' exonuclease activity, 
distance from the amino terminus to the first domain critical for exonuclease activity, 
and length of the expressed protein, the shortened form (MET284) of Tnia polymerase 

25 should possess 3'-5' exonuclease and proof-reading activity but lack 5'-3* exonuclease 
activity. However, initial SDS activity gel assays and solution assays for 3'-5' 
exonuclease activity suggested significant attenuation in the proof-reading activity of the 
polymerase expressed by £. qqIx host cells harboring plasmid pTmal5. 

MET284 Tma DNA Polymerase was purified from E. coH strain DGl 16 

30 containing plasmid pTmal 5. The seed flask for a 10 L fermentation contained tryptonc 
(20 g/1), yeast extract (10 g/1), NaCl (10 g/1). glucose (10 g/1), ampicillin (50 mg/1), and 
thiamine (10 mg/1). The seed flask was innoculated with a colony from an agar plate (a 
frozen glycerol culture can be used). The seed flask was grov/n at 30*C to bcr.vccn 0.5 
to 2,0 O.D. (Afigo). The volume of seed culture inoculated into the fermenior is 

35 calculated such tiiat the bacterial concentration is 0.5 mg dry weight/liter. The 10 liter 
growth medium contained 25 rxM KH2PO4, 10 mM (NH4)2S04, 4 mM.sodium citrate, 
0.4 mM FeCl3, 0.04 mM ZnCb. 0,03 mM C0CI2. 0.03 mM CuCb, and 0,03 mM 
H3BO3. The following sterile components were added; 4 mM MgS04, 20 g/i glucose, 
20 mg/1 thiamine, and 50 mg/1 ampicillin. The pH was adjusted to 6.8 w^ih NaOH and 
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controUcd during the fermentation by added NH4OH. Glucose was continually added 
by coupUng to NH4OH addition. Foaming was controlled by the addition of propylene 
glycol as necessary, as an antifoaming agent. Dissolved oxygen concentration was 
maintained at 40%. 

5 The fermentor was inoculated as described above, and the culture was grown at 
30*C to a cell density of 0.5 to 1.0 X IQio cells/ml (optical density [Aeso] of 15). The 
growth temperature was shifted to 38*C to induce the synthesis of MET284 Tma DNA 
polymerase. The temperature shift increases tiie copy number of the pTmalS plasmid 
and simultaneously dereprcsses the lambda Pl promoter controlling transcription of the 
10 modified Ima DNA polymerase gene through inactivation of the tempera ture-sensirive 
cl repressor encoded by the defective prophage lysogen in the host 

The cells were grown for 6 hours to an optical density of 37 (Aeso) and 
harvested by centrifugation. The cell mass (ca. 95 g/1) was resuspcnded in an 
equivalent volume of buffer containing 50 mM Tris-Cl, pH 7.6, 20 mM EDTA and 
15 20% (w/v) glycerol. The suspension was slowly dripped into liquid nitrogen to freeze 
the suspension as "beads" or small pellets. The frozen ceUs were stored at -70'C. 

To 200 g of frozen beads (containing 100 g wet weight cell) were added 100 ml 
of IX TE (50 mM Tris-Cl. pH 7.5, 10 mM EDTA) and DTT to 0.3 mM. PMSF to 2.4 
mM, leupeptin to 1 |ig/ml and "ELCK (a protease inhibitor) to 0.2 mM. The sample 
20 was thawed on ice and uniformly resuspended ia a blender at low speed. The cell 
suspension was lysed in an Aminco french pressure cell at 20,000 psi. To reduce 
viscosity, the lysed cell sample was sonicated 4 times for 3 min. each at 50% duty cycle 
and 70% output. The sonicate was adjusted to 550 ml with IX TE containing 1 mM 
DTT, 2.4 mM PMSF, 1 |ig/ml leupeptin and 0.2 rnM TLCK (Fraction I). After 
25 addition of ammonium sulfate to 0.3 M, die crude lysate was rapidly brought to 75'C in 
a boiling water batii and transferred to a 75*C water bath for 15 min. to denature and 
inactivate E. csli host proteins. The heat-treated sample was chilled rapidly to 0*C and 
incubated on ice for 20 min. Precipitated proteins and cell membranes were removed 
by centrifugation at 20,000 X G for 30 min. at 5'C and the supernatant (Fraction H) 
30 saved. 

The heat-treated supernatant (Fraction U) was treated witii polyetiiyleneimine 
(PEI) to remove most of the DNA and RNA. Polymin P (34.96 ml of 10% [w/v], pH 
7.5) was slowly added 10 437 ml of Fraction II at O'C while stirring rapidly. After 30 
min. at O'C, the sample was centrifuged at 20,000 X G for 30 min. The supernatant 
35 (Fraction 10) was appUed at 80 ml/hr to a 100 ml phenylseparose column (3.2x12.5 
cm) that had been equilibrated in 50 mM Tris-Cl, pH 7.5. 0.3 M ammonium sulfate, 10 
mM EDTA, and 1 mM DTT. The column was washed with about 200 ml of the same 
buffer (A28O to baseline) and then witii 150 ml of 50 mM Tris-Cl, pK 7.5, 100 mf4 
KCl. 10 mM EDTA and 1 mM DTT. The MET284 Irna DNA polymerase was then 
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eluted from the column with buffer containing 50 n:iM Tris-Cl, pH 7,5, 2 M urea. 20% 
(w/v) ethylene glycol, 10 mM EDTA, and 1 mM DTT, and ftactions containing DNA 
polymerase activity were pooled (Fraction IV). 

Fraction IV is adjusted to a conductivity equivalent to 50 mM KCl in 50 mM 
5 Tris-CI, pH 7.5, 1 mM EDTA, and 1 mM DTT. The sample was applied (at 9 ml/hr) to 
a 15 ml heparin-sepharose colunrm that had been equilibrated in the same buffer. The 
colunin was washed with the same buffer at ca. 14 ral/hr (3.5 column volumes) and 
eluted with a 150 ml 0.05 to 0.5 M KCl gradient in the same buffer. The DNA 
polymerase activity eluted between 0.1 1-0.22 M KCL Fractions containing the 

10 pTmal5 encoded modifed Ima DNA polymerase are pooled, concentrated, and 

diafiltered against 2.5X storage buffer (50 xrM Tris-Cl, pH 8.0, 250 mM KCl, 0.25 
mM EDTA, 2.5 mM DTT, and 0.5% Tween 20), subsequendy mbced with 1.5 
volumes of sterile 80% (w/v) glycerol, and stored at -20*C. Optionally, the heparin 
sepharose-eluted DNA polymerase or the phenyl sepharose-^luted DNA polymerase 

15 can be dialyzed or adjusted to a conductivity equivalent to 50 mM KCl in 50 mM 
Tris-Cl, pH 7.5. 1 mM DTT, 1 mM EDTA, and 0.2% Tween 20 and appUed (1 mg 
protein/nil resin) to an affigel blue column that has been equilibrated in the same buffer. 
The column is washed with three to five column volumes of the same buffer and eluted 
with a 10 column volume KCl gradient (0.05 lo 0.8 M) in the same buffer. Fractions 

20 containing DNA polymerase activity (eluting between 0.25 and 0.4 M KCl) are pooled, 
concentrated, diafiltered, and stored as above. 

The relative tiiermoresistance of various DNA polymerases has been compared. 
At 97.5'C the half-life of native Tma DNA polymerase is more tiian twice the half-life 
of eitiier native or recom.binant Tag DNA (i.e., AmpliTaq<5>) DNA polymerase. 

25 Surprisingly, the half-life at 97,5'C of MET284 Tnia DNA polymerase is 2.5 to 3 times 
longer than the half-life of native Tm^ DNA polymerase. 

PCR tubes containing 10 mM Tris-Cl. pH 8.3. and 1.5 mM MgCl2 (for Tag or 
native Ima DNA polymerase) or 3 mM Mga2 (for MET284 Ima DNA polymerase). 
50 mM KCl (for lafl, native Ima and MET284 Ima DNA polymerases) or no KCl (for 

30 MET284 Trna DNA polymerase). 0.5 \iM each of primers PCROl and PCR02. 1 ng of 
lambda template DNA, 200 ^xM of each dNTP except dCTP, and 4 units of each 
enzyme were incubated at 97.5'C in a large water bath for times ranging from 0 to 60 
min. Samples were withdrawn with time, stored at 0*C, and 5 \i\ assayed at 75*C for 
10 min. in a standard activity assay for residual activity. 

35 JsQ DNA polymerase had a half-life of about 10 min. at 97.5*C, while native 

Tma DNA polymerase had a half-iifc of about 21 to 22 min. at 97.5'C. Surprisingly, 
the MET284 form of Tnia DNA polymerase had a significanlty longer half-life (50 to 
55 min.) than either Tag or native Tma DN.A polymerase. The improved 
thermoresistance of MET284 Tma DNA polymerase wUl find applications in PCR, 
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particularly where G+C-rich targets are difficult to amplify because the 
strand-separation temperature required for complete denaiuration of target and PGR 
product sequences leads to enzyme inactivation. 

PGR tubes containing 50 ^il of 10 mM Tris-Gl. pH 8.3, 3 mM MgCb. 200 
5 of each dNTP, 0.5 ng bacteriophage lambda DNA, 0.5 nM of primer PGROl , 4 units 
of MET284 Trm DNA polymerase, and 0.5 ^iM of primer PGR02 or PLIO were cycled 
for 25 cycles using Tden of 96*G for 1 min. and T<u„eai.«iend of 60'C for 2 min. 
Lambda DNA template, deoxynucleotide stock solutions, and primers PGROl and 
PGR02 were part of the PECIGeneAmp® kit Primer PLIO has the sequence: (SEQ 
10 ID NO. 45) 5'-GGGGTAGGnTGTCTCACGGGCAAC-3' and is complementary to 
bacteriophage lambda nucleotides 8106-8130. 

The primers PGROl and PGR02 ampUfy a 500 bp product from lambda. The 
primer pair PCROl and PLIO amplify a 1 kb product from lambda. After amplification 
with the respective primer sets. 5 ^1 aliquois were subjected to agarose gel 
1 5 electrophoresis and the specific intended product bands visualized with ethidium 

bromide staining. Abundant levels of product were generated with both primer sets, 
showing LHat MET284 Ima DNA polymerase successfully amplified the intended target 
sequence. 

Example 7 

20 p.x pression of Truncated Tma Polymerasg 

As noted above, host ells transformed with plasmids that contain the complete 
Tma DNA polymerase gene coding sequence express a shortened fonn (MET140) of 
Tma polymerase either exclusively or along with the fuU length polymerase. Mutations 
can be m.ade to control which form of the polymerase is expressed. To enhance the 
25 exclusive expression of the MET140 form of the polymerase, the coding region 

corresponding to amino acids through 139 were deleted from the expression vector. 
The protocol for constnicting such a deletion is similar to the constniction described in 
Example 6: a shortened gene fragment is excised and then reinserted into a vector from 
which a full length fragment has been excised. However, the shortened fragment can 
30 be obtained as a PGR amplification product rather than purified from a restriction 

digest. This methodology allows a new upstream restriction site (or other sequences) 
to be incorporated where useful. 

To delete the region up to the methionine codon at position 140, an SpllI site 
was introduced into pTmal2-l and pTmal3 using PGR. A forward primer (FL63) was 
35 designed to introduce the S^hl site just upstream of the methionine codon at position 
140. The reverse primer (FL69) was chosen to include an Xbal at position 624. 
Plasmid pTmal2-l hnearized with Smal was used as tiie PGR template, yielding a 225 
bp PGR product. 
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Before digestion, the PGR product was treated with 50 (ig/ml of Proteinase K 
in PGR reaction mix plus 0.5% SDS and 5 mM EDTA. After incubating for 30 
minutes at 37*C, the Proteinase K was heat inactivated at 68*G for 10 minutes. This 
procedure eliminated any la^ polymerase bound to the product that could inhibit 

5 subsequent restricrion digests. The buffer was changed to a TE buffer, and the excess 
PGR primers were removed with a Centricon 100 microconcentrator. 

The amplified fragment was digested with Sphl . then treated with Klenow to 
create a blunt end at the Sph l-cleaved end, and finally digested with Xbal . The 
resulting fragment was ligaied with plasmid pTmal3 (pTmal2-l would have been 

10 suitable) that had been digested with NcoL repaired with Klenow, and then digested 
with 2QfflI- The ligation yielded an in-frame coding sequence with the region between 
the initial Nco l site (upstream of the first methionine codon of the coding sequence) and 
the introduced Sph I site (upsu-eam of the methionine codon at position 140) deleted. 
The resulting expression vector was designated pTmal6. 

15 The primers used in this example are given below and in the Sequence Listing 

section. 

Primer SEP ID NO: Sequence 

FL63 SEQ ID NO: 30 5'GATAAAGGCATGCTTCAGGTTGTGAACG 

FL69 SEQ ID NO: 31 5TGTACTTCTCTAGAAGCTGAACAGCAG 

20 ' Example 8 

Elimination of Undcsired RBS in MET! 40 Expression Vectors 
Reduced expression of the MET140 form of Tma DNA polymerase can be 
achieved by eliminating the ribosome binding site (RBS) upstream of the methionine 
codon at position 140. The RBS was be eliminated via oligonucleotide site-directed 

25 mutagenesis without changing the amino acid sequence. Taking advantage of the 

redundancy of the genetic code, one can make changes in the third position of codons 
to alter the nucleic acid sequence, thereby eliminating the RBS. without changing the 
amino acid sequence of the encoded protein. 

A mutagenic primer (FL64) containing the modified sequence was synthesized 

30 and phosphorylated. Single-stranded pTma09 (a full length clone having an Nco l site) 
was prepared by coinfecting with the helper phage R408, connmercially available from 
Straiagene. A *'ganped duplex" of single stranded pTma09 and the large fragment from 
the Pvu II digestion of pBS 1 3+ was created by mixing the two plasmids, heating to 
boiling for 2 minutes, and cooling to 65*G for 5 minutes. The phosphorylated primer 

35 was then annealed with the "gapped duplex" by mixing, heating to 80*C for 2 minutes, 
and then cooling slowly to room temperature. The remaining gaps were filled by 
extension with Klenow and the fragments ligated witii T4 DNA ligase, both reactions 
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taking place in 200 \lM of each dNTP and 40 ^iM ATP in standard salts at 37*C for 30 
minutes. 

The resulting circular fragment was transformed into DGlOl host cells by plate 
transformations on nitrocellulose filters. Duplicate filters were made and the presence 

5 of the correct plasmid was detected by probing with a y^^P-phosphorylated probe 
(FL65). The vector that resulted was designated pTmal9. 

The RBS minus portion from pTmal9 was cloned into pTmal2-l via an 
Ncol/Xbal fragment switch. Plasmid pTmal9 was digested with NsqI and XM, and 
the 620 bp fragment was purified by gel electrophoresis, as in Example 7, above. 

10 Plasmid pTmal2.1 was digested with Hcfil. JCiffll. and }^£rQ)i. The Xflnl cleavage 
inactivates the RBS+ fragment for the subsequent ligation step, which is done under 
conditions suitable for ligating "sticky" ends (dilute ligase and 40 ^iM ATP). Finally, 
the ligation product is transformed into DGl 16 host cells for expression and designated 
pTmal9-RBS. 

1 5 The oligonucleotide sequences used in this example are listed below and in the 

Sequence Listing section. 

Oligo SEP ID NO: S?q\igr.cg 

FL64 SEQ ID NO: 32 5'CTGAAGCATGTCTTTGTCACCGGTTACTATGAATAT 
FL65 SEQ ID NO: 33 5'TAGTAACCGGTGACAAAG 

20 Example 9 

pvpr^ceinp nf TTTp^pteH Tma DNA Polvmerase MET ASP21 
To effect translation initiation at about the aspartic acid codon at position 21 of 
the Tma DNA polymerase gene coding sequence, a methionine codon is introduced 
before the codon, and the region from the initial Ncol site to this introduced methionine 
25 codon is deleted. The deletion process involves PGR with the same downstream 
primer described above (FL69) and with an upstream primer (FL66) designed to 
incorporate an Nco l site and a methionine codon to yield a 570 base pair product. 

The amplified product is concentrated with a Ccntricon-100 microconcentraior 
to eliminate excess primers and buffer. The product is concentrated in a Speed Vac 
30 concentrator and then resuspended in the digestion mix. Tne amplified product is 
digested with Ncol and 2<bal. Likewise. pTmal2-l, pTmal3, or pTmal9-RBS is 
digested with the same r;,'0 restriction enzymes, and the digested, ampUfied fragment is 
Ugated with the digested expression vector. The resulting construct has a deletion from 
the N£2l site upstream of the start codon of the native Jjm coding sequence to the new 
35 methionine codon introduced upstream of the aspartic acid codon at position 21 of the 
native Tma coding sequence. 

Similarly, a deletion mutant can be created such that translation initiation begins 
at Glu74, the glutamic acid codon at position 74 of the native Tnia coding sequence. 
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An upstream primer (FL67) is designed to introduce a methionine nodon and an Ncol 
site before Glu74. The downstream primer and cloning protocol used are as described 
above for the MET-ASP21 construct 

The upstream primer sequences used in this example are listed below and in the 
5 Sequence Listing section. 

Oligo SEP ID NO: Sequence 

FL66 SEQ ID NO: 34 5 'CTATGCC ATGG ATAG ATCGCTTTCTACTTCC 

FL67 SEQ ID NO: 35 S^CAAGCCCATGGAAACTTACAAGGCTCAAAGA 

Example 10 

10 Expression Vectors With T7 Promoters 

Expression efficiency can be altered by chan^ :g the promoter and/or ribosomal 
binding site (RBS) in an expression vector. The T7 cenelO promoter and RBS have 
been used to drive expression of Tma DNA polymerase from expression vector 
pTmal7, and the T7 GenelO promoter and the Gene N RBS have been used to drive 

15 expression of Ima DNA polymerase from expression vector pTmalS. The 

construction of these vectors took advantage of unique restriction sites present in 
pTmal2-l: an A fill site upstream of the pronaoter, an Ncol site downstream of the 
RBS, and a BspE I site between the promoter and the RBS. The existing promoter was 
. excised from pTmal2-l and replaced with a synthetic T7 Gene 10 promoter using 

20 techniques similar to those described in the previous example^. 

The synthetic insert was created from two overlapping synthetic 
oligonucleotides. To create pTmal7 (with T7 Gene 10 RBS). equal portions of FR414 
and FR416 were mixed, heated to boiling, and cooled slowly to room temperature. 
The hybridized oligonucleotides were extended with Klenow to -^ate a full length 

25 double stranded insen. The extended fragment was then digested witii AQII and Nco l. 
leaving the appropriate **sticky" ends. The insen was cloned into plasmid pTmal2-l 
digested with Afln and Nco l. DGl 16 host cells were transformed with the resulting 
plasmid and transformants screened for the desired plasmid. 

The same procedure was used in Uie creation of pTmalS (with Gene N RBS), 

30 except that FR414 and FR41 8 were used, and the extended fragment was digested with 
Afln and BspEL This DNA fragment was substituted for the Pl promoter in plasmid 
pTmal2-l that had been digested with AolII and BspEI. 

Plasmids pTmal7 and pTmalS are used to transform £. qq\\ host cells that have 
been modified to contain an inducible T7 DNA polymerase gene. 

35 The oligonucleotides used in the construction of these vectors are listed below 

and in the Sequence Listing section. 
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FR414 SEQIPN0:36 

S'TCAGCTrAAGACrrCGAAATrAATACGACTCACTATAGGGAGACCACAACGGTIT. 
CCCTC 

FR416 SEQIDNO:37 
5 5 'TCG ACCATGGGTATATCTCCn-CTTA AAGTTA A ACAAAATTATTTCTAG AGGG A AACC- 

GTTG 

FR418 SEQIDNO:38 

5 'TC AGTCCGG ATAAACAAAATTATTTCTAG AGGG AAACCGTTG 

Example 1 1 

10 Trf^n^lnnO"'^^ Coupling 

As described above, transladonal coupling can increase the efficiency of 
expression of a protein by coupling a short coding sequence just upstream of the 
iniuation site of the coding sequence for the protein. Termination of translation of the 
upstream coding sequence leaves the ribosome in close proximity to the initiation site 

15 for the downstream coding sequence. The upstream coding sequence functions only to 
move the ribosome downstream to the stan of the coding sequence for the desired 
protein. 

Translationally coupled Tjna expression ve-tors were constructed with the 
translation initiation signal and Hrst ten codons of the T7 bacteriophage major capsid 
20 protein (gene 10) fused in-frame to the last six codons of E. coH TrpE placed upstream 
of the Ima coding region. The TG A (stop) codon for TipE is "coupled" with tiie ATG 
(start) codon for the Ima gene, foiming the sequence TGATG. A one base frame-shift 
is required between translation of the short coding sequence and translation of the Ima 
coding sequence. 

25 In the example below, a fragment containing Uie T7 Gene 10-E. £sli TrpE.' irpD 

fusion product (the last 6 codons ar.d TGA stop codon from TrpE along with the 
overlapping ATG start codon from TrpD) was transferred from a pre-existing plasmid. 
One of ordinary skill will recognize that the T7 Gene lO-E- COliTrpE/TrpD fusion 
product used in the constnicrion of the tnmslationally coupled expression vectors can be 
30 consmicted as a synthetic oligonucleotide. The sequence for the inserted fragment is 
listed below and in the Sequence Listing section. 

The T7 Gene 10-E. coli TrpE/TrpD fusion product was ampUfied from plasmid 
pSYC1868 with primers FL48 and FL49. With primers FL51 and FL53, the 5' end of 
the Ima Pol I gene in pTma08 (a full length clone containing an Hsifil site) was 
35 amplified from the ATG stan codon to tiie MiqI site downstream of the ATG start 

codon. The primers FL51 and FL49 were designed to leave overiapping regions such 
that the two amplified products could be annealed and extended, essentially as 
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described in Example 10. The two amplificarion products were mixed, heated to 95*C, 

slowly cooled to room temperature to anneal, and extended with Tag polymerase. 
The extended insen was amplified with primers FL48 and FL53 and then 

digested with Xma l and Mro l. Plasmid pTmal2- 1 was digested with Mro l and treated 
5 with calf intestine alkaline phosphatase to prevent re-ligation. The digested pTmal2-l 

was ligated with the insen. DGl 16 host cells were transformed with the resulting 

construa and transformants screened for the desired plasmid DNA. The resulting 

vector was designated pTma20. 

The sequences of the oligonucleotide primers and the T7 Gene 10-E. qq\\ 
10 TrpE/TrpD fusion product (Gene 10 insen) are listed below and in the Sequence Listing 

section. 

Primers SEP ID NO: Sequence 

FL48 SEQ ED NO: 39 5TCCGGACTTTAAGAAGGAGATATAC 
FL49 SEQ ID NO: 40 5'AATAGTCTAGCCATCAGAAAGTCTCCTGTGC 
15 FL51 SEQ ID NO: 41 S'AGACTITCTGATGGCTAGACTATTTCTT 
FL53 SEQ ID NO: 42 S'CTGAATCAGGAGACCCGGGGTCTTTGGTC 
Gene 1 0 insen SEQ ID NO: 

5'CnTAAGAAGGAGATATACATATGGCTAGCATGACTGGTGGACAGCAAATG 
CATGCACAGGAGACTTTCTGaTG 

20 Example 12 

Arg U tRNA Expression 
The pattern of codon usage differs between Thermotoga maritima and E. coli . 
In the Tm.a coding sequence, arginine is most frequently coded for by the "AGA" 
codon, whereas this codon is used in low frequency in E. coH host cells. The 

25 corresponding "Arg U" tRNA appears in low concentrations in E. coH. The low 
concentration in the host cell of Arg tRNA using the "AGA" codon may limit die 
translation efficiency of the Tma polymerase gene. The efficiency of translation of tiie 
Tma coding sequence within an £. qq\\ host may be improved by increasing the 
concentration of this tRNA species by cloning m.ultiple copies of the tRNA gene into 

30 the host cell using a second expression vector that contains tiie gene for the "Arg I J" 
tRNA. 

The Arg U tRNA gene was PGR amplified from E. coH, genomic DNA using 
the primers DG284 and DG285. The amplification product was digested with Sai l and 
JBamHI. The ColEI compatible vector pACYC184 was digested with Sail and Bam HI. 
35 and the Arg U gene fragment was subsequendy ligated with the digested vector. 
' DGlOl cells were transformed, and the ligated vector was designated pARGOl. 
Finally, DGl 16 host cells were co-transformed with pARGOl and pTmal2-} . 
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The oUgonuclcotide primers used in this Example are Usted below and in the 
Sequence Listing section. 

Primers SEP ID NO: Sequence 

DG284 SEQ ID NO: 43 S'CGGGGATCCAAAAGCCATTGACTCAGCAAGG 
5 DG285 SEQ ID NO: 44 5'GGGGGTCGACGCATGCGAGGAAAATAGACG 
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We Qaim : 

1 . A purified thermostable DNA polymerase I enzyme that catalyzes 
combination of nucleoside triphosphates to form a nucleic acid strand complementary to 
a nucleic acid template strand, said enzyme derived from the eubacterium Thcrmotoga 

5 maririma^ 



2. The enzyme of claim 1 having a molecular weight between about 97 
kilodaltons and 103 kilodaltons. 

10 3. The enzyme of claim 1 that has reverse transcriptase activity. 

4. The enzyme of claim 1 that has 3*->5' exonuclease activity. 

5. The enzyme of claim 1 that is in native form. 

6. The enzyme of claim 2 that is in native form. 

7 . The enzyme of claim 1 that is in recombinant form. 



20 8. The enzyme of claim 2 that is in recombinant form. 

9. The enzyme of claim 2 that has reverse transcriptase and 3*— >5' 
exonuclease activity. 

25 10. The enzyme of claim 9 that is in native form. 



1 1 . The enzyme of claim 9 that is in recombinant form. 

1 2. A method for purifying Themotog a maririma DNA polymerase from X 
30 maritima cells, said method comprising the steps of: 

(a) preparing a crude cell extract from said cells; 

(b) adjusting the ionic strength of said extract so that said polymerase 
dissociates from any nucleic acid in said extract; 

(c) subjecting the extract to hydrophobic interaction chromatography; 

35 (d) subjecting the extract to DNA binding protein affinity chromatography; 

(e) subjecting the extract to nucleotide binding protein affinity 
chromatography; and 

(f) subjecting the extract to chromatography selected from the group 
consisting of anion exchange, cation exchange, and hydroxyapatite chromatography. 
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1 3. A recombinant DNA sequence that encodes Thermotopa m^ritiin^ DNA 
polymerase I activity. 



5 




14. The DNA sequence of claim 13 that encodes the amino acid sequence of 




SEQ ID NO: 1 , from amino to carboxy temiinus: 








1 


MARLFLFDGT 


ALAYRAYYAL 


DRSLSTSTGI 


PTNATYGVAR 


MLVRFIKDHI 




51 


IVGKDYVAVA 


FDKKAATFRH 


KLLETYKAQR 


PKTPDLLIQQ 


LPYIKKLVEA 




101 


LGMKVLEVEG 


YEPJJDIIATL 


AVKGLPLFDE 


IFIVTGDKDM 


LQLVNEKIKV 


10 


151 


WRIVKGISDL 


ELYDAQKVKE 


KYGVEPQQIP 


DLLALTGDEI 


DNIPGVTGIG 




201 


EKTAVQLLEK 


YKDLEDILNH 


VRELPQKVRK 


ALLRDRENAI 


LSKKLAILET 




251 


NVPIEINWEE 


LRYQGYDREK 


LLPLLKELEF 


ASIMKELQLY 


EESEPVGYRI 




301 


VKDLVEFEKL. 


lEKLRESPSF 


AIDLETSSLD 


PFDCDIVGIS 


VSFKPKEAYY 




351 


IPLHHRNAQN 


LDEKEVLKKL 


KEILEDPGAK 


IVGQNLKFDY 


KVLMVKGVEP 


15 


401 


VPPYFDTMIA 


AYLLEPNEKK 


FNLDDLALKF 


LGYKMTSYQE 


LMSFSFPLFG 




451 


FSFADVPVEK 


AAIJYSCEDAD 


ITYRLYKTLS 


LKLHEADLEN 


VFYKIEMPLV 




501 


NVLARMELNG 


VYVDTEFLKK 


LSEEYGKKLE 


ELAEEIYRIA 


GEPFNINSPK 




551 


QVSRILFEKL 


GIKPRGKTTK 


TGDYSTRIEV 


LEELAGEHEI 


IPLILEYRKI 




601 


QKLKSTYIDA 


LPKMVNPKTG 


RIHASFNQTG 


TATGRLSSSD 


PNLQNLPTKS 


20 


651 


EEGKEIRKAI 


VPQDPNWWIV 


SADYSQIELR 


ILAHLSGDEN 


LLRAFEEGID 




701 


VHTLTASRIF 


NVKPEEVTEE 


MRRAGKtWNF 


SIIYGVTPYG 


LSVRLGVPVK 




751 


EAEKMIVNYF 


VLYPKVRDYI 


QRWSEAKEK 


GYVRTLFGRK 


RDIPQLMARD 




801 


RNTQAEGERI 


AIMTPIQGTA 


ADIIKLAMIE 


IDRELKERKM 


RSKMIIQVHD 




851 


ELVFEVPNEE 


KDALVELVKD 


R1"1TNVVKLSV 


PLEVDVTIGK 


TWS 



25 

» 

15, The DNA sequence of claim 14 that is the DNA sequence of SEQ ID 

NO: 1: 

1 ATGGCGAGAC TATTTCTCTT TGATGGAACT GCTCTGGCCT ACAGAGCGTA 

51 CTATGCGCTC GATAGATCGC TTTCTACTTC CACCGGCATT "CCCACAAACG 

30 101 CCACATACGG TGTGGCGAGG ATGCTGGTGA GATTCATCAA AGACCATATC 

151 ATTGTCGGAA AAGACTACGT TGCTGTGGCT TTCGACAAAA AAGCTGCCAC 

201 CTTCAGACAC AAGCTCCTCG AGACTTACAA GGCTCAAAGA CCAAAGACTC 

251 CGGATCTCCT GATTCAGCAG CTTCCGTACA TAAAGAAGCT GGTCGAAGCC 

301 CTTGGAATGA AAGTGCTGGA GGTAGAAGGA TACGAAGCGG ACGATATAAT 

35 351 TGCCACTCTG GCTGTGAAGG GGCTTCCGCT TTTTGATGAA ATATTCATAG 

401 TGACCGGAGA TAAAGACATG CTTCAGCTTG TGAACGAAAA GATCAAGGTG 

451 TGGCGAATCG TAAAAGGGAT ATCCGATCTG GAACTTTACG ATGCGCAGAA 

501 GGTGAAGGAA AAA.TACGGTG TTGAACCCCA GCAGATCCCG GATCTTCTGG 

551 CTCTAACCGG AGATGAAATA GACAACATCC CCGGTGTAAC TGGGATAGGT 
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10 



15 



20 



25 



30 



35 



601 


GAAAAGACTG 


CTGTTCAGCT 


TCTAGAGAAG 


TACAAAGACC 


TCGAAGACAT 


651 


ACTGAATCAT 


GTTCGCGAAC 


TTCCTCAAAA 


GGTGAGAAAA 


^^^^ ^^Ml ^^^^ Ml Ml 

GCCCTGCTTC 


701 


GAGACAGAGA 


AAACGCCATT 


CTCAGCAAAA 


AGCTGGCGAT 


TCTGGAAACA 


751 


AACGTTCCCA 


TTGAAATAAA 


CTGGGAAGAA 


CTTCGCTACC 


AGGGCTACGA 


801 


CAGAGAGAAA 


CTCTTACCAC 


TTTTGAAAGA 


ACTGGAATTC 


GCATCCATCA 


851 


TGAAGGAACT 


TCAACTGTAC 


GAAGAGTCCG 


AACCCGTTGG 


AiACAGAATA 


901 


GTGAAAGACC 


TAGTGGAATT 


TGAAAAACTC 


ATAGAGAAAC 


TGAGAGAATC 


951 


CCCTTCGTTC 


GCCATAGATC 


TTGAGACGTC 


TTCCCTCGAT 


CCTTTCGACT 


1001 


GCGACATTGT 


CGGTATCTCT 


GTGTCTTTCA 


Vk ^^^^ % K % Ik 

AACCAAAGGA 


AGCGTACTAC 


^ ft p ^ 
1051 


ATACCACTCC 


ATCATAGAAA 


CGCCCAGAAC 


CTGGACGAAA 


^ ^ ^ Ml Ml Ml 

AAGAGGTTCT 


1101 


GAAAAAGCTC 


AAAGAAATTC 


TGGAGGACCC 


CGGAGCAAAG 


V Ml ^^^^ Ml Ml PWl 

ATCGTTGGTC 


1151 


AGAATTTGAA 


ATTCGATTAC 


AAGGTGTTGA 


TGGTGAAGGG 


TGTTGAACCT 


1201 


GTTCCTCCTT 


ACTTCGACAC 


GATGATAGCG 


GCTTACCTTC 


TTGAGCCGAA 


1251 


CGAAAAGAAG 


TTCAATCTGG 


ACGATCTCGC 


ATTGAAATTT 


CTTGGATACA 


1301 


AAATGACATC 


TTACCAAGAG 


CTCATGTCCT 


TCTCTTTTCC 


GCTGTTTGGT 


1351 


TTCAGTTTTG 


CCGATGTTCC 


TGTAGAAAAA 


GCAGCGAACT 


ACTCCTGTGA 


1401 


AGATGCAGAC 


ATCACCTACA 


GACTTTACAA 


GACCCTGAGC 


TTAAAACTCC 


1451 


ACGAGGCAGA 


TCTGGAAAAC 


GTGTTCTACA 


AGATAGAAAT 


^4 #^ Ml Ml Ml 

GCCCCTTGTG 


1501 


AACGTGCTTG 


CACGGATGGA 


ACTGAACGGT 


GTGTATGTGG 


ACACAGAGTT 


T C C -1 

1551 


CCTGAAGAAA 


CTCTCAGAAG 


AGTACGGAAA 


AAAACTCGAA 


GAACTGGCAG 


•4 ^ ft 
1601 


AGGAAATATA 


CAGGATAGCT 


GGAGAGCCGT 


TCAACATAAA 


CTCACCGAAG 


1651 


CAGGTTTCAA 


GGATCCTTTT 


TGAAAAACTC 


GGCATAAAAC 


CACGTGGTAA 


1701 


AACGACGAAA 


ACGGGAGACT 


ATTCAACACG 


^ Ml V % % Ml 

CATAGAAGTC 


CTCGAGGAAC 


Tien 

1751 


TTGCCGGTGA 


ACACGAAATC 


ATTCCTCTGA 


Ml Ml m Ml Jk H Ml 

TTCTTGAATA 


CAGAAAGATA 


loOl 


CAGAAAiTGA 


AATCAACCTA 


CATAGACGCT 


CTTCCCAAGA 


TGGTCAACCC 


1851 


AAAGACCGGA 


AGGATTCATG 


CTTCTTTCAA 


TCAAACGGGG 


HL Ml % Ml 

ACTGCCACTG 


1 y u 1 


oAAGALTTAG 


CAGCAGCGAT 


CCCAATCTTC 


AGAACCTCCC 


GACGAAAAGT 




VjAAbAGbGAA 


AAGAAATCAG 


GAAAGCGATA 


GTTCCTCAGG 


ATCCAAACTG 


o A 1 


GTGGATCGTC 


AGTGCCGACT 


ACTCCCArti\T 


AGAACTGAGG 


ATCCTCGCCC 


o r» Q 1 
ZU51 


A 1 UTC AG A GG 


TGATGAGAAT 


CTTTTGAGGG 


CATTCGAAGA 


GGGCATCGAC 




GTCCACACTC 


TAACAGCTTC 


CAGAATATTC 


AACGTGAAAC 


CCGAAGAAGT 


2151 


AACCGAAGAA 


ATGCGCCGCG 


M^ ^ ^ ^ ^ Ml 

CTGGTAAAAT 


^^^^MiMi%. ^mmm^n 

GGTTAATTTT 


m Mi^N m Ml m m\ 

TCCATCATAT 


^ #-> ft "1 

2201 


ACGGTGTAAC 


ACCTTACGGT 


CTGTCTGTGA 


GGCTTGGAGT 


ACCTGTGAAA 


2251 


GAAGCAGAAA 


AGATGATCGT 


CAACTACTTC 


GTCCTCTACC 


CAAAGGTGCG 


2301 


CGATTACATT 


CAGAGGGTCG 


TATCGGAAGC 


GAAAGAAAAA 


GGCTATGTTA 


2351 


GAACGCTGTT 


TGGAAGAAAA 


AGAGACATAC 


CACAGCTCAT 


GGCCCGGGAC 


2401 


AGGAACACAC 


AGGCTGAAGG 


AGAACGAATT 


GCCATAAACA 


CTCCCATACA 


2451 


GGGTACAGCA 


GCGGATATAA 


TAAAGCTGGC 


TATGATAGAA 


ATAGACAGGG 


2501 


AACTGAAAGA 


AAGA^JJ^TG 


AGATCGAAGA 


TGATCATACA 


GGTCCACGAC 



I 

I 
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2551 GAACTGGTTT TTGAAGTGCC CAATGAGGAA AAGGACGCGC TCGTCGAGCT 
2601 GGTGAAAGAC AGAATGACGA ATGTGGTAAA GCTTTCAGTG CCGCTCGAAG 
2651 TGGATGTAAC CATCGGCAAA ACATGOTCGT GA 

5 1 6 . A recombinant DN A vector that comprises the DNA sequence of claim 1 3. 

17. A recombinant host cell transforaied with the vector of claim 16. 

1 8 . The recombinant host cell of claim 17 that is E. sqH. 



10 



19. llie enzyme of claim 1, having a molecular weight about 86 kilodaltons, 
that is in recombinant form. 



20. The enzyme of claim 1 , having a molecular weight about 70 kilodaltons, 
15 that is in recombinant fomi. 

21 . The DNA sequence of claim 13 that encodes aiTiino acids number 140 to 
893 ofSEQ ID N0:1. 

20 22. The DNA sequence of claim 21 that is nucleotides number 418 to 2682 of 

SEQIDNO: I. 

23. The DNA sequence of claim 13 that encodes amino acids number 284 to 
893ofSEQIDNO:l. 

25 

24. The DNA sequence of claim 23 that is nucleotides number 850 to 2682 of 
SEQIDNO: 1. 
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